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PRELIMINARY AMENDMENT 



For: METHODS AND COMPOSITIONS FOR 
REGULATED TRANSCRIPTION AND 
EXPRESSION OF HETEROLOGOUS 
GENES 



BOX PATENT APPLICATION 
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Washington, D.C. 20231 



Applicant is submitting herewith a Preliminary Amendment in the above-referenced 
patent application. Prior to examination of the application, the Examiner is respectfully 
requested to enter the following amendments. 

In the Specification 

At page 1, line 6, after "This application is a" insert --continuation of U.S. S.N. 



Sir: 
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In the Claims 

17. (Amended) A method for obtaining a plant having a modified phenotype, said 
method comprising: 

transforming a host plant cell with a DNA construct under genomic integration 
conditions, wherein said construct comprises as operably linked components in the direction of 
transcription, a promoter region obtainable from a gene, wherein transcription of said gene is 
preferentially regulated in plant seed tissue, a DNA sequence of interest other than the native 
coding sequence of said gene, and transcription termination region, wherein said components are 
functional in a plant cell, 

whereby said DNA construct becomes integrated into a genome of said plant cell, 

regenerating a plant from said transformed plant cell, and 

growing said plant under conditions whereby said DNA sequence of interest is expressed 
and a plant having said modified phenotype is obtained. 

1 8. (Amended) A method of altering the phenotype of plant seed tissue as distinct from 
other plant tissue, said method comprising: 

growing a plant, wherein said plant comprises cells containing a DNA construct 
integrated into their genome, said DNA construct comprising, in the 5 r to 3' direction of 
transcription, a transcriptional initiation region from a gene, wherein transcription of said gene is 
preferentially regulated in a plant seed tissue, a DNA sequence of interest other than the coding 
sequence native to said transcriptional initiation region, and a transcriptional termination region, 
whereby said DNA sequence of interest is transcribed under transcriptional control of said 
transcriptional initiation region and a plant having an altered phenotype is obtained. 

19. (Reiterated) The method according to Claim 17 or 18, wherein said DNA construct 
is flanked by T-DNA. 

20. (Reiterated) The method according to Claim 19, wherein said plant is soybean or 

Z 



rapeseed plant 



21. (Reiterated) The method according to Claim 17 or 18 wherein said DNA sequence of 
interest encodes an enzyme. 

22. (Reiterated) The method according to Claim 17 or 18 wherein said DNA sequence of 
interest is an antisense sequence. 

23. (Reiterated) The method according to Claim 17 or 18 wherein said gene is 
transcribed during seed embryogenesis. 

24. (Reiterated) The method according to Claim 23 wherein said gene is transcribed from 
about day 7 to day 40 postanthesis. 

25. (Reiterated) The method according to Claim 17 or 18 wherein said gene is 
transcribed during seed maturation. 

26. (Reiterated) The method according to Claim 25 wherein said gene is transcribed 
about day 1 1 to day 30 postanthesis. 

27. (Reiterated) The method according to Claim 18, wherein said transcriptional 
initiation region further comprises a translational initiation region and said DNA sequence of 
interest is an open reading frame encoding an amino acid sequence. 

28. (Reiterated) A method for modifying a genotype of a plant to impart a desired 
characteristic to seed as distinct from other plant tissue, said method comprising: 

transforming under genomic integration conditions, a host plant cell with a DNA 
construct comprising in the 5' to 3' direction of transcription, a transcriptional initiation region 
from a gene, wherein said transcription of said gene is regulated in a plant seed tissue, a DNA 
sequence of interest other than the native coding sequence of said gene, and a transcriptional 
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termination region, whereby said DNA construct becomes integrated into the genome of said 
plant cell; 

regenerating a plant from said transformed host cell; and 
growing said plant to produce seed having a modified genotype. 

29. (Reiterated) The method according to Claim 28, wherein said DNA construct is 
flanked by T-DNA. 

30. (Reiterated) The method according to Claim 28, wherein said plant is a Brassica 

plant. 

3 1 . (Reiterated) The method according to Claim 28, wherein said DNA sequence of 
interest encodes an enzyme. 

32. (Reiterated) The method according to Claim 28, wherein said DNA sequence of 
interest is an antisense sequence. 

33. (Reiterated) The method according to Claim 28, wherein said plant is a soybean 

plant. 

34. (Reiterated) A method for modifying transcription in seed tissue as distinct from 
other plant tissue, said method comprising: 

growing a plant capable of developing seed tissue under conditions to produce seed, 
wherein said plant comprises cells containing a DNA construct integrated into their genome, said 
DNA construct comprising, in the 5' to 3' direction of transcription, a seed-specific 
transcriptional initiation region, a DNA sequence of interest other than the coding sequence 
native to said transcriptional initiation region, and a transcriptional termination region, whereby 
said DNA sequence of interest is transcribed under transcriptional control of said seed-specific 
transcriptional initiation region. 



35. (Reiterated) The method according to Claim 34, wherein said DNA sequence of 
interest is an antisense sequence. 

36. The method according to Claim 34, wherein said plant is of the genus Brassica. 

37. (Reiterated) The method according to Claim 34, wherein said transcriptional 
initiation region further comprises a translational initiation region and said DNA sequence of 
interest is an open reading frame encoding an amino acid sequence. 

38. (Reiterated) The method according to Claim 34, wherein said plant is a soybean 

plant. 

39. (Reiterated) A method to selectively express a heterologous DNA sequence of 
interest in seed tissue as distinct from other plant tissue, said method comprising: 

growing a plant capable of developing a seed tissue under conditions to produce seed, 
wherein said plant comprises cells having a genomically integrated DNA construct comprising, 
as operably linked components in the 5' to 3' direction of transcription, a seed- specific 
transcriptional initiation region and a translational initiation region, a DNA sequence of interest 
other than the coding sequence native to said transcriptional initiation region, a transcriptional 
termination region downstream of said DNA sequence of interest, whereby said DNA sequence 
of interest is expressed under control of said seed-specific transcriptional and translational 
initiation region. 

40. (Reiterated) The method according to Claim 39, wherein said plant is of the genus 
Brassica, 

41 . (Reiterated) The method according to Claim 39, wherein said plant is a soybean 

plant. 

Add the following new claims: 
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42. (New) A method according to Claim 17 or Claim 18, wherein said gene is selected 
from the group consisting of a napin gene, ACP gene, cruciferin gene, or EA9 gene. 

43. (New) The method according to Claim 17 or Claim 18, wherein said DNA sequence 
of interest is a structural gene. 

44. (New) The method according to Claim 17 or Claim 18, wherein said DNA sequence 
of interest is an open reading frame encoding an amino acid sequence. 

45. (New) The method according to Claim 17 or Claim 18, wherein said transcriptional 
initiation region further comprises a translational initiation region. 

46. (New) A method for modifying transcription in plant seed tissue as distinct from 
other plant tissue, said method comprising growing a plant wherein said plant comprises cells 
containing a DNA construct integrated into their genome, said construct comprising: 

regulatory region from a gene wherein said gene is expressed in plant seed tissue, a DNA 
sequence of interest other than the coding sequence native to said regulatory regions, whereby 
said DNA sequence of interest is expressed under control of said regulatory regions. 

47. (New) The method according to Claim 47, wherein said regulatory region comprise 
transcriptional and translational initiation and termination regions. 

48. (New) A method to selectively express a heterologous DNA sequence of interest in 
plant seed tissue as distinct from other plant tissue, said method comprising: 

growing a plant wherein said plant comprises cells containing a DNA construct integrated 
into their genome, said construct comprising a tissue specific expression cassette and a DNA 
sequence of interest, wherein said DNA sequence of interest is expressed under the control of 
said tissue specific expression cassette whereby said DNA sequence of interest is expressed in 
plant seed tissue. 
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REMARKS 



No new matter is introduced by these amendments and the Examiner is respectfully 
requested to enter them. 



Rae- Venter Law Group, P.C 
P. O. Box 60039 
Palo Alto, CA 94306 
Telephone: (650) 328-4400 
Facsimile: (650)328-4477 

BRV:amp 



Respectfully submitted, 





Barbara Rae- Venter, Ph.D. 
Reg. No. 32,750 
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METHODS AND COMPOSITIONS FOR REGULATED TRANSCRIPTION 
AND EXPRESSION OF HETEROLOGOUS GENES 

5 

This application is a continuation of U.S. S.N. 
08/484,941, filed June 7, 1995, which is a continuation of 
U.S. S.N. 08/105,852, filed 8/10/93, pending; U.S. S.N. 
08/105,852 is a continuation in part of 07/526,123, filed 

10 5/21/90, pending, which is a continuation of 07/267,865, 
filed 11/2/88, abandoned, which is a continuation of 
06/692,605, filed 1/17/85, abandoned; U.S. S.N. 08/105,852, is 
also a continuation in part of 07/582,241, filed 9/14/90, 
abandoned, which is a continuation of 07/188,361, filed 

15 4/29/88, abandoned, which is a continuation in part of 

07/168,190, filed 3/15/88, abandoned, which is a continuation 
in part of 07/054,369, filed 5/26/87, which issued on 7/24/90 
as patent number 4,943,674; U.S. S.N. 08/105,852 is also a 
continuation in part of U.S. S.N. 07/742,834, August 8, 1991, 

20 which issued as U.S. Patent No. 5,420,034 issued on 5/30/95, 
which is a continuation in part of 07/550,804, filed 7/9/90, 
abandoned, which is a continuation in part of 07/147,781, 
filed 1/25/88, abandoned, which is a continuation in part of 
07/078,538, filed 7/28/87, abandoned, which is a continuation 

25 in part of 06/891,529, filed 7/31/86, which is abandoned. 

INTRODUCTION 

Technical Field 

30 

This invention relates to regulated genetic modification 
of plant material, particularly for tissue and/or 
developmental specific trascription and expression. 
Heterologous constructs are provided whereby production of 



-1- 



endogenous products can be modulated or new capabilities 
provided ♦ 

Background 

While the ability to manipulate bacterial and 
5 mammalian cells by hybrid DNA technology has been available 
for almost a decade, only in 1983 was it first reported that 
successful expression of an exogenous gene was achieved in a 
plant cell. Plants have a highly complex genome and differ 
in numerous ways from both bacterial and mammalian genes. 

10 Therefore, while as a first approximation, one may 

extrapolate from the experience with other species, the 
relevance of such experience must be determined by 
experimentation. In general, genetic engineering techniques 
have been directed to modifying the phenotype of individual 

15 prokaryotic and eukaryotic cells, especially in culture. 
Plant cells have proven more intransigent than other 
eukaryotic cells due not only to the lack of suitable vector 
systems but also a result of the different goals involved. 
Plant genetic engineering has for the most part been 

2 0 directed to modifying the entire plant or a particular 
tissue rather than modifying a single cell in culture. 

In order to be able to successfully modify plant 
cells, it will be necessary to develop a large number of 
different systems for introducing the exogenous DNA into the 

25 plant cell, for directing, as appropriate, the introduced 
DNA either randomly or to particular genomic sites, to 
provide for constitutive or regulated expression and, as 
appropriate, to provide for transport of the product to an 
appropriate site. Toward this end, it will be necessary to 

30 develop a wide variety of regulatory signals involved with 
replication, transcription, translation, integration, and 
the like. To varying degrees these regulatory signals will 
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have general application across species or be species- 
specific , will be associated with specific stages of plant 
growth, or be subject to external control* To that extent, 
it will be necessary to develop a wide spectrum of 
regulatory sequences to allow for expression under 
predetermined conditions . 

For many applications, it will be desirable to 
provide for transcription in a particular plant tissue 
and/or at a particular time in the growth cycle of the plant 
or maturation cycle of the tissue. Toward this end, there 
is substantial interest in identifying endogenous plant 
products transcription or expression of which is regulated 
in a manner of interest. In identifying such products, one 
must first look for a product which appears at a particular 
time in the cell growth cycle or in a particular plant 
tissue, demonstrate its absence at other times or in other 
tissue, identify nucleic acid sequences associated with the 
product and then identify the sequence in the genome of the 
plant in order to obtain the 5 '-untranslated sequence 
associated with transcription. Identifying the particular 
sequence, followed by establishing that it is the correct 
sequence and isolating the desired transcriptional 
regulatory region requires an enormous outlay in time and 
effort- One must then prepare appropriate constructs, and 
demonstrate that the constructs are efficacious in the 
desired manner. 

There has been substantial interest in modifying the 
seed with transcriptional initiation regions to afford 
transcription and expression of the gene introduced into the 
seed, rather than constitutive expression which would result 
in expression throughout the plant* Also of interest is the 
ability to change the phenotype of fruit, so as to provide 
fruit which will have improved aspects for storage, 



handling, cooking, organoleptic properties, freezing, 
nutritional value, and the like* 

In addition, different systems may be required for 
the introduction of nucleic acid into plant cells to obtain 
reasonable efficiencies of transformation and functioning of 
the nucleic acid. In many instances, such as the tumor 
inducing plasmids and viruses, the vectors have found 
limited utilization in their range of hosts* Therefore, 
different transformation and replication systems may be 
required for different plant species. 

Relevant Literature 

Lack of transformation by AgroJbacterium of soybean 
is reported by DeCleene and DeLey, The Botanical Review 
(1976) 42:389-446, Encouraging results in the 
transformation (Pederson et al., Plant Cell Repts. (1983) 
2:201-204 and Hood et all, Bio /Technology (1984) 2:702-708) 
and regeneration (Christianson et al., Science (1983) 222: 
632-634) of soybean have recently been reported* A light 
inducible soybean SSU gene (small subunit SSU) of ribulose- 
1, 5-bisphosphate-carboxylase (RuSP-carboxylase) is reported 
by Berry-Lowe et al . , J m Mol . Appln. Gen. (1982) 2:483-4 98. 
Sequences 5' to the pSSU gene were recently shown to direct 
foreign gene expression in a light-inducible manner when 
transferred into tobacco callus (Herrera-Estrella et al.. 
Nature (.1984) 310:115-120). 

Crouch et al . , In: Molecular Form and Function of 
the Plant Genome, eds. van VI ot en-Doting, Groot and Hall, 
Plenum Publishing Corp, 1985, pp 555-566; Crouch and Sussex, 
Planta (1981) 153:64-74; Crouch et al., <7* Mol. Appl. Genet. 
(1983) 2:273-283; Simon et al . , Plant Molecular Biology 
(1985) 5:191-201; and Scofield and Crouch, J. Biol. Chem. 
(1987) 262:12202-12208, describe various aspects of Brassica 



napus storage proteins- Rose et al., Nucl . Acids Res. 
(1987) 15:7197 and Scherer and Knauf, Plant Mol. Biol. 
(1987) 9:127-134 describe ACP genes* Beachy et al., ZMBO J. 
(1985) 4:3047-3053; Sengupta-Gopalan et al., Proc. Natl. 
5 Acad. Sci. C7SA (1985) 82:3320-3324; Greenwood and 

Chrispeels, Plant Physiol. (1985) 79:65-71 and Chen et al., 
Proc. Natl. Acad. Sci. tfSA (1986) 83:8560-8564 describe 
studies concerned with seed storage proteins and genetic 
manipulation. Eckes et al., Mol. Gen. Genet. (1986) 205:14- 

10 22 and Fluhr et al., Science (1986) 232:1106-1112 describe 
the genetic manipulation of light inducible plant genes. 

cDNA clones from tomato displaying differential 
expression during fruit development have been isolated and 
characterized (Mansson et al . , Mol. Gen. Genet. (1985) 

15 200:356-361; Slater et al . , Plant Mol. Biol. (1985) 5:137- 
147) . The studies have focused primarily on mRNAS which 
accumulate during fruit ripening. One of the proteins 
encoded by the ripening-specific cDNAs has been identified 
as polygalacturonase (Slater et al., Plant Mol. Biol. (1985) 

20 5:137-147). A cDNA clone which encodes tomato 

polygalacturonase has been sequenced. Grierson et al . , 
Nucleic Acids Research (1986) 14:8395-8603. The 
concentration of polygalacturonase mRNA increases 2000-fold 
between the immature-green and red-ripe stages of fruit 

25 development. This suggests that expression of the enzyme is 
regulated by the specific mRNA concentration which in turn 
is regulated by an increase in transcription. Delia Penna 
et al., Proc. Natl. Acad. Sci. USA (1986) 83:6420-6424. 
Mature plastid mRNA for psbA (one of the components of 

30 photosystem II) reaches its highest level late in fruit 

development, whereas after the onset of ripening, plastid 
mRNAS for other components of photosystem I and II decline 
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to nondetectable levels in chromoplasts. Piechulla et al . , 
Plant Mol. Biol. (1986) 7:367-376. 

Summary of the Invention 
Novel methods and DNA constructs are provided for 
5 transforming plants employing T-DNA and a Ti- or Ri-plasmid 
for heterologous DNA introduction and integration into the 
plant genome* Transformation without gall formation of 
plant cells which have historically not been Agrojbacteriuizz 
hosts is achieved with successful expression of the 

10 heterologous DNA. Additionally, DNA constructs are provided 
which are employed in manipulating plant cells to provide 
for regulated transcription, such as light inducible 
transcription, in a plant tissue or plant part of interest 
at particular stages of plant growth or in response to 

15 external control. Particularly, transcriptional regions 
from seed storage proteins, seed coat proteins or acyl 
carrier protein are joined to other than the homologous gene 
and introduced into a plant cell host for integration into 
the genome to provide for seed-specific transcription. The 

20 constructs provide for modulation of expression of 

endogenous products as well as production of exogenous 
products in the seed. Novel DNA constructions also are 
provided employing a fruit-specific promoter, particularly a 
promoter from a gene active beginning at or shortly after 

25 anthesis or beginning at the breaker stage, joined to a DNA 
sequence of interest and a transcriptional termination 
region. A DNA construct may be introduced into a plant cell 
host for integration into the genome and transcription 
regulated at a time at or subsequent to anthesis. In this 

3 0 manner, high levels of RNA and, as appropriate, 

polypeptides, may be achieved during formation and/or 
ripening of fruit. 
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Brief Description of the Drawings 
Figure 1 is a partial sequence of the promoter 
region of the XBnNa napin gene. The start (ATG) of the open 
reading frame is underlined* 
5 Figure 2 is a restriction map of cloned XCGN1-2 

showing the entire coding region sequence as well as 
extensive 5' upstream and 3' downstream sequences. 

Figure 3 is a partial nucleotide sequence of genomic 
ACP clone Bcg4-4. The coding region is indicated by the 
10 three-letter amino acid codes. Breaks in the coding region 
sequence represent introns. The underlined nucleotide at 
position 310 is ambiguous without further sequence analysis 
for confirmation. 

Figure 4 is the complete nucleotide sequence of J3. 
15 campestris cDNA EA9 . The longest open reading frame is 

designated by the three letter amino acid code. PolyA tails 
are evident at the end of the sequence and a potential 
polyadenylation signal is underlined. 

Figure 5 shows the nucleotide sequence of the cDNA 
20 clones PCGN1299 (2A11) and PCGN1298 (3H11) ♦ The amino acid 
sequence of the polypeptide encoded by the open reading 
frame is also indicated. 

Figure 6 is a comparison of 2A11 to pea storage 
proteins and other abundant storage proteins: 
25 (a) 2A11 (residues 3 3-46) is compared to PAlb and 

the reactive site sequences of some protease inhibitors, 
Palb (residues 6-23) , chick pea inhibitor (residues 11-23) , 
lima bean inhibitor (residues 23-35) , human a 1- antitrypsin 
reactive site peptide. The arrow indicates the reactive 
30 site. 

(b) is a comparison of the amino terminal sequence 
of 2A11 with the amino termini of a range of seed proteins. 
The data have been modified or deletions introduced to 



maximize homology; conserved residues are shown boxed* The 
sequences are from the following sources: PAlb; barley 
chlorof orm/methanol-soluble protein d; wheat albumin; wheat 
a-amylase inhibitor 0.28; millet bi-f unctional inhibitor; 
castor bean 2S small subunit; and napin small subunit. 

Figure 7 shows the complete sequence of the 2 All 
genomic DNA cloned into PCGN1273 from the Xhol site 
(position 1 at the 5' end) to the EcoR I site (position 
4654) • 

Figure 8 shows the nucleotide sequence of a 
polygalacturonase (PG) genomic clone* 

Figure 9 shows 2A11 genomic constructs. The upper 
line shows a map of the 2A11 genomic clone. The 
transcriptional start site, the polyadenylation site, the 
start (ATG) and stop (TGA) sites and the position of the 
intron are indicated. The hatched region indicates the 
portion of the genomic clone that was used to make the 
tagged 2 All constructions. The bottom portion shows the 
regions used to construct the 2A11 cassettes including the 
synthetic oligonucleotide used to insert restriction sites 
and reconstruct the 3' end. 

Figure 10 shows examples of 2A11 cassettes. Four 
versions of the 2A11 cassette are shown. They differ only 
in the flanking poly-linker regions and in the antibiotic 
resistance marker on the plasmid. 

Description of the Preferred Embodiments 
In accordance with the subject invention, DNA 
constructs are provided which allow for regulated 
modification of plant phenotype for example during fruit 
development and ripening, in specific plant structures 
derived from the ovum, and in chloroplast containing plant 
tissues such as leaves. The DNA constructs comprise a 



regulated transcriptional initiation region. Downstream 
from the regulated transcriptional initiation region will be 
a sequence of interest which will provide for regulated 
modification of plant phenotype, by modulating the 
5 production of an endogenous product, as to amount, relative 
distribution , or the like, or production of an exogenous 
expression product to provide for a novel function or 
product* Thus genes of interest as a source of regulated 
transcriptional initiation regions include those genes 

10 associated with seed formation, preferably in association 

with embryogenesis and seed maturation and those associated 
with fruit maturation and ripening, fruit rotting and light- 
induced processes in chloroplasts- The transcriptional 
cassette will include in the 5 7 -3 7 direction of 

15 transcription, a regulated transcriptional and translational 
initiation region, a sequence of interest, and a 
transcriptional and translational termination region 
functional in plants. One or more introns may be also 
present* 

2 0 In addition to the transcription construct, 

depending upon the manner of introduction of the 
transcription construct into the plant, other DNA sequences 
may be required. The subject invention includes a novel 
method provided for the introduction of foreign DNA 

2 5 employing T-DNA from an Agrobacterium plasmid, where 

efficient functional introduction of heterologous DNA is 
achieved in plants normally considered outside the 
Agrobacterium range, e*g,, monocotyledons and leguminous 
dicotyledons, without gall formation. The method can also 

3 0 be used with the known dicotyledon hosts of Agrobacterium, 

DNA constructs are made which can be inserted into an 
Agrobacterium plasmid for transfer to a plant host. Plant 
hosts of particular interest are the grains and legumes. 
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When using the Ti- or Ri-plasmid for transformation 
of plant cells, as described below, at least the right 
border and frequently both the right and left borders of the 
T-DNA of the Ti- or Ri-plasaids will be joined as flanking 
regions to the transcription construct. The use of T-DNA as 
a flanking region in a construct for integration into a Ti- 
or Ri- plasmid has been described in EPO Application 
No, 116,718 and PCT Application Nos. WO84/02913, 02919 and 
02920. See also Herrera-Estrella, Nature (1983) 303:209- 
213; Fraley et al., Proc. Natl • Acad, Sci, USA (1983) 
80:4803-4807; Horsch et al . , Science (1984) 223:496-498; and 
DeBlock et al., EMBO J. (1984) 3:1681-1689. Various 
fragments may be employed in the constructions to provide 
for homology with the T-DNA of the tumor plasmids. The 
homology may involve structural genes, promoter regions , 
other untranslated regions such as border regions, or the 

or 

like. 

Downstream from and under the transcriptional 
initiation regulation of the regulatable initiation region 
will be a sequence of interest which will provide for 
modification of the phenotype of the specific plant tissue 
or part. Desirably, integration constructs may be prepared 
which allow for integration of the transcriptional cassette 
into the genome of a plant host. Conveniently, the vector 
may include a multiple cloning site downstream from the 
regulated transcriptional initiation region, so that the 
integration construct may be employed for a variety of 
sequences in an efficient manner. The DNA construct will 
also provide for a termination region, so as to provide an 
expression cassette into which a gene may be introduced. 
Conveniently, transcriptional initiation and termination 
regions may be provided separated in the direction of 
transcription by a linker or poly linker having one or a 
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plurality of restriction sites for insertion of the gene to 
be under the transcriptional regulation of the regulatory 
regions. Usually, the linker will have from 1 to 10, more 
usually from about 1 to 8 , preferably from about 2 to 6 
5 restriction sites. Generally, the linker will be fewer than 
100 bp, frequently fewer than 60 bp and generally at least 
about 5 bp. In conjunction with the subject method these 
constructs may be used for the introduction of the 
structural gene into plant cells in culture, where the cells 

10 may be regenerated into whole plants* 

The DNA constructs which are provided employ T-DNA 
flanking regions, flanking a structural gene including 
transcriptional and translational regulatory sequences* 
Thus, the construct which includes the structural gene, its 

15 transcriptional and translational regulatory controls, and 
the T-DNA flanking regions will for the most part have the 
following formula: 

(T') a - P - S.G. - Te - (T*) h 

wherein: 

2 0 T 1 and T 2 are the same or different and are T-DNA 

from a Ti- plasmid or an Ri- plasmid, where a and b and 0 or 
1, at least 1 of a and b being 1; 

P is a promoter region recognized by a plant host, 
which promoter region may include promoters derived from Ti- 
25 or Ri- plasmids, such as the octopine synthase or nopaline 
synthase promoters, viral promoters, plant promoters, 
particularly leguminous and monocotyledonous plant host 
promoters of various structural genes, e.g., RuBP- 
carboxylase, more particularly SSU. The promoter region 

3 0 will normally include a region for binding of RNA 

polymerase, as well as a cap site. In addition, there may 
be present enhancers, operators, activators, or other 
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regions involved with transcriptional regulation. The 
transcriptional initiation region may be native or 
homologous to the host or foreign or heterologous to the 
host. By foreign is intended that the transcriptional 
5 initiation region is not found in the wild-type host into 
which the transcriptional initiation region is introduced. 

S.G. intends a structural gene having an open 
reading frame and having at its 5 7 -end an initiation codon 
and at its 3 '-end one or more nonsense codons. The DNA 

10 sequence may have any open reading frame encoding a peptide 
of interest, e.g. an enzyme, or a sequence complementary to 
a genomic sequence, where the genomic sequence may be an 
open reading frame, an intron, a non-coding leader sequence, 
or any other sequence where the complementary sequence will 

15 inhibit transcription, messenger RNA processing, e.g. 

splicing, or translation. The DNA sequence of interest may 

9 

be synthetic, naturally derived, or combinations thereof. 
Depending upon the nature of the DNA sequence of interest, 
it may be desirable to synthesize the sequence with plant 
20 preferred codons. The plant preferred codons may be 

determined from the codons of highest frequency in the 
proteins expressed in the largest amount in the particular 
plant species of interest. 

Te intends a termination region functional in the 

2 5 plant host cell. The termination region, besides including 

at least one terminating sequence, may also include a polyA 
signal. The termination region which is employed will be 
primarily one of convenience, since the termination regions 
appear to be relatively interchangeable. The termination 

3 0 region may be native with the transcriptional initiation 

region, may be native with the DNA sequence of interest, or 
may be derived from another source. Convenient termination 
regions are available from the Ti-plasmid of A. tumef aciens . 



such as the octopine synthase and nopal ine synthase 
termination regions. 

Identifying useful regulated transcriptional 
initiation regions may be achieved in a number of ways* For 
5 example, where a fruit or seed protein has been or is 

isolated, it is partially sequenced, so that a probe can be 
designed for identifying messenger RNA specific for fruit or 
seed. To further enhance the concentration of the messenger 
RNA specifically associated with fruit or seed, cDNA can be 

10 prepared and the cDNA subtracted with messenger RNA or cDNA 
from non-seed or non-fruit associated cells. The residual 
cDNA can then be used for probing the genome for 
complementary sequences, using an appropriate library 
prepared from plant cells* Sequences which hybridize to the 

15 cDNA then can be isolated, manipulated, and the 5'- 

untranslated region associated with the coding region 

9 

isolated and used in expression constructs to identify the 
transcriptional activity of the $' -untranslated region. In 
some instances, a probe may be employed directly for 

2 0 screening a genomic library and identifying sequences which 

hybridize to the probe. The sequences will be manipulated 
as described above to identify the 5 ' -untranslated regions. 

As an example, a promoter of particular interest for 
the subject invention, the fruit-specific transcriptional 
25 initiation region (promoter) from a DNA sequence which 
encodes -a protein described as 2A11 in the experimental 
section was identified as follows* cDNA clones made from 
ripe fruit were screened using cDNA probes made from ripe 
fruit, green fruit, and leaf mRNA* Clones were selected 

3 0 having more intense hybridization with the fruit DNAs as 

contrasted with the leaf cDNAs . The screening was repeated 
to identify a particular cDNA referred to as 2A11. The 2A11 
cDNA was then used for screening RNA from root, stem, leaf, 



and seven stages of fruit development after the mRNA was 
sized on gels* The screening demonstrated that the 
particular message was present throughout the seven stages 
of fruit development- The mRNA complementary to the 
5 specific cDNA was absent in other tissues which were tested. 
The cDNA was then used for screening a genomic library and a 
fragment selected which hybridized to the subject cDNA. The 
5' and 3 'non-coding regions were isolated and manipulated 
for insertion of a foreign sequence to be transcribed under 

10 the regulation of the 2A11 promoter. 

The expression constructs which are prepared 
employing the regulated 5' -untranslated regions may be 
transformed into plant cells as described previously for 
evaluation of their ability to function with a heterologous 

15 structural gene (i.e., a gene other than the open reading 
frame associated with the 5 ' -untranslated region) and 
specificity of expression for example in a particular plant 
tissue or plant part such as leaves, seed or fruit. In this 
manner, specific sequences may be identified for use with 

20 sequences for fruit or seed-specific transcription and 
light-induced transcription. 

Several promoters are of particular interest. These 
include the soybean SSU promoter, promoters from genes 
encoding storage proteins and seed embryo genes and those 

25 from genes that are activated at or shortly after anthesis. 
The transcriptional initiation region may be native or 
homologous to the host or foreign or heterologous to the 
host. By foreign relative to a particular host is intended 
that the transcriptional initiation region is not found in 

3 0 the wild-type host into which the transcriptional initiation 
region is introduced. Other fruit-specific promoters may be 
activated at times subsequent to anthesis, such as prior to 
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or during the green fruit stage, during pre-ripe (e.g., 
breaker) or even into the red fruit stage. 

By use of the soybean SSU promoter, it is found that 
the expression of the gene under the SSU promoter can be 
5 light-induced. Thus, the expression of the gene is 
regulatable, where enhanced expression occurs during 
irradiation with light, while substantially reduced 
expression or no expression occurs in the absence of light. 
The nucleotide sequence of the small subunit gene is 

10 described by Berry-Lowe, J" ♦ Mol . Appl. Gen. (1982) 1:483- 
498. A Ddel digest of a plasmid containing a genomic 
fragment which includes the SSU soybean gene yields a l.lkd 
5' piece that can be used as a promoter fragment. 

Transcriptional initiation regions from genes 

15 encoding storage proteins, such as those from genes encoding 
napin, cruciferin, 0-conglycinin, phaseolin, or the like, 
and proteins involved in fatty acid biosynthesis, such as 
acyl carrier protein (ACP) are also of interest. The 
transcriptional initiation regions may be obtained from any 

20 convenient host, particularly plant hosts such as Brassiest, 
e.g. napus or campestris , soybean (Glycine max), bean 
(Phaseolus vulgaris), corn (Zea mays), cotton (Gossypium 
sp.) , saf flower (Carthamus tinctorius) , tomato (Ly copers icon 
esculentum) , and Cuphea species. 

25 Other transcriptional initiation regions of 

particular interest are those associated with seed embryo 
genes that are expressed in the period from about day 7 to 
day 40, particularly those having maximum expression in the 
period from about day 10 to about day 30, postanthesis, and 

3 0 seed coat genes which are expressed in the period from about 
day 11 to day 30. Usually the period of expression will be 
at least 3 days, more usually about 7 days and may be 
substantially over the entire period. 



Also of interest is a transcriptional initiation 
region which is activated at or shortly after anthesis, so 
that in the early development of the fruit, it provides the 
desired level of transcription of the sequence of interest. 
5 Normally, the sequence of interest will be involved in 

affecting the process in the early formation of the fruit or 
providing a property which is desirable during the growing 
(expansion) period of the fruit, or at or after harvesting. 

The ripening stages of the tomato may be broken down 

10 into mature green, breaker, turning, pink, light red and 
red* Desirably, the transcriptional initiation region 
maintains its activity during the expansion and maturation 
of the green fruit, more desirably continues active through 
the ripening or red fruit period. Comparable periods for 

15 other fruit are referred to as stages of ripening. The 

invention is not limited to those transcriptional initiation 
regions which are activated at or shortly after anthesis but 
also includes transcriptional initiation regions which are 
activated at any of the ripening stages of the fruit. An 

2 0 example of a fruit-specific transcriptional initiation 

region is the one referred to as 2A11 which regulates the 
expression of a 2A11 cDNA sequence described in the 
Experimental section. The 2A11 transcriptional initiation 
region provides for an abundant messenger, being activated 
25 at or shortly after anthesis and remaining active until the 
red fruit stage. The expressed protein is a sulfur-rich 
protein similar to other plant storage proteins in sulfur 
content and size. 

Also of interest is a transcriptional initiation 

3 0 region which regulates expression of the enzyme 

polygalacturonase, an enzyme which plays an important role 
in fruit softening and/ or rotting. The polygalacturonase 
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promoter is active in at least the breaker through red fruit 
stage in tomato fruit. 

Any structural gene of interest may be employed for 
use in the construct. In many instances, it will be 
5 desirable to have another structural gene to serve as a 

marker associated with the construct, so that one can detect 
those plant cells in which the foreign gene has been stably 
introduced. For the most part, these constructs will have 
the following formula: 
10 (T ! ) a - P ! -(S.G.) 1 - Te 1 ) - (PMS.GO 2 - Te J ) - (T 2 ) b 

wherein: 

all of the symbols have the same functional 
definition except that the superscripts for P and Te intend 
that the promoter and terminator regions may be the same or 

15 different, where one is a marker and the other is a 

structural gene of interest. Of course, one may provide for 
a string of expression constructs having a plurality of the 
same or different genes in the construct. Thus, the 
presence of only two genes flanked by the T-DNA is merely 

20 i 1 lustrat i ve « 

As markers for structural genes, one can employ 
antibiotic resistance genes, e.g., a kanamycin resistance 
gene or methotrexate resistance gene (DHFR) . These genes 
are described in Haas and Dowding, supra* Other markers 

25 include resistance to a biocide, particularly an antibiotic, 
such as-G418, bleomycin, hygromycin, chloramphenicol, or the 
like. The particular market employed will be one which will 
allow for selection of transformed cells as compared to 
cells lacking the DNA which has been introduced. 

3 0 The structural gene of interest may be any gene, 

either native, mutant native, or foreign to the plant host, 
and may be provided in a sense or antisense orientation. 
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For native and mutant genes, the gene may provide for 
increased capability of protein storage/ improved nutrient 
source, enhanced response to light, enhanced dehydration 
resistance, e.g., to heat, salinity or osmotic pressure, 
herbicide resistance, e.g., glyphosate, or the like. 
Foreign genes may include enhancement of native 
capabilities, herbicide resistance, resistance to various 
pests, such as viruses, insects, bacteria or fungi, 
production of foreign products, as a result of expression of 
one or more foreign genes, or the like. 

In preparing the cassette construct, the various DNA 
sequences may normally be inserted or substituted into a 
bacterial plasmid. Any convenient plasmid may be employed, 
which will be characterized by having a bacterial 
replication system, a marker which allows for selection in 
the bacterium and generally one or more unique, conveniently 
located restriction sites. These plasmids, referred to as 
vestors, may include such vestors as pACYC184, pACYC177, 
pBR322, pUC9, the particular plasmid being chosen based on 
the nature of the markers, the availability of convenient 
restriction sites, copy number, and the like. Thus, the 
sequence may be inserted into the vector at an appropriate 
restriction site(s), the resulting plasmid used to transform 
the E. coll host, the E. coll grown in an appropriate 
nutrient medium and the cells harvested and lysed and the 
plasmid recovered. One then defines a strategy which allows 
for the stepwise combination of the different fragments* 

As necessary, the fragments may be modified by 
employing synthetic adapters, adding linkers, employing in 
vitro mutagenesis or primer repair to introduce specific 
changes in the sequence, which may allow for the 
introduction of a desired restriction site, for removing 
superfluous base pairs, or the like. By appropriate 
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strategies, one desires to minimize the number of 
manipulations required as well as the degree of selection 
required at each stage of manipulation. After each 
manipulation, the vector containing the manipulated DNA may 
5 be cloned, the clones containing the desired sequence 
isolated, and the vector isolated and purified* As 
appropriate, hybridization, restriction mapping or 
sequencing may be employed at each stage to ensure the 
integrity and correctness of the sequence, 

10 The cassette constructs may be introduced into the 

plant host cell in a variety of ways, such as an insertion 
into a tumor- or gall-producing plasmid, as bare DNA, as an 
insertion in a plant DNA virus such as A. tumefaciens or A* 
rhizogenes as the transforming agent, protoplast fusion, 

15 injection, electroporation, etc- For transformation with 
Agrobacterium , plasmids can be prepared in E . coli which 
plasmids contain DNA homologous with the Ti-plasmid, 
particularly T-DNA. The plasmid may or may not be capable 
of replication in Agrobacterium, that is, it may or may not 

20 have a broad spectrum prokaryotic replication system, e.g., 
RK2 90, depending in part upon whether the transcription 
construct is to be integrated into the Ti-plasmid or be 
retained on an independent plasmid. By means of a helper 
plasmid, the transcription construct may be transferred to 

25 the A. tumefaciens and the resulting transformed organism 
used for ^transforming plant cells. 

Alternatively, to enhance integration into the plant 
genome, terminal repeats of transposons may be used as 
borders in conjunction with a transposase. In this 

3 0 situation, expression of the transposase should be 

inducible, so that once the transcription construct is 
integrated into the genome, it should be relatively stably 
integrated and avoid hopping. 



Conveniently, explants may be cultivated with A. 
tumefaciens or A. rhizogenes to allow for transfer of the 
expression cassette to the plant cells, the plant cells 
dispersed in an appropriate selective medium for selection, 
5 grown to callus, shoots grown and plantlets regenerated from 
the shoots by growing in rooting medium. The Agrobacterium 
host will contain a plasmid having the vir genes necessary 
for transfer of the T-DNA to the plant cells and may or may 
not have T-DNA. For injection and electroporation, disarmed 

10 Ti-plasmids (lacking the tumor genes, particularly the T-DNA 
region) may be used to introduce genes into the plant cell* 

In accordance with the subject invention, an 
efficient procedure is provided for introduction of foreign 
DNA into plant cells with integration of the DNA and without 

15 gall formation, particularly as to plants which previously 
have been reported to be outside the host range of 
Agrobacterium. For a list of plant genera and species which 
are hosts and non-hosts for Agrobacterium, see De Cleene and 
Le Ley, The Botanical Review (1976) 42:389-466, Of 

2 0 particular interest in the subject invention are dicotyledon 

legumes, such as soybean, and monocotyledon grains, such as 
corn, rice, wheat, barley and oats- 

Where a tumor- or gall -producing plasmid, e.g., the 
Ri-or Ti-plasmid, is to be used to introduce the cassette 
25 into the plant cell, a binary plasmid, which includes an 
Agrobacterium functional replication system, or bacterial 
mating may be employed, whereby the cassette-carrying 
plasmid is transferred from a compatible bacterium to A. 
rhizogenes or A. tumefaciens and the transconjugant isolated 

3 0 and analyzed for integration of the cassette into the Ri- or 

Ti-plasmid* This can be readily determined by various 
techniques, such as Southern analysis. 
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The Ti- or Ri-plasmid which is employed should be 
capable of providing for integration of T-DNA in the host 
without observable symptoms of tumor or gall formation* 
Thus, the plasmid which is selected may be tumor-producing 
in a conventional host, but will not produce tumors in 
plants normally considered not to be hosts • An illustrative 
plasmid is pTiA6 , a wild-type plasmid. The A. rhizogenes or 
A. tumefaciens bacteria containing the cassette and the Ri- 
or Ti-plasmid may now be used for transformation of a plant 
host cell. 

For transformation particularly of monocatoledenous 
or leguminous plants, the subject method employs in vitro 
grown seedlings between green V-E and V-l (Fehr and 
Caviness, 1977, Stages of Soybean Development, Iowa State 
Coop. Ext. Serv. , Agric. and Home Econ. Expt. Stn. Special 
Report 80) . Thus, young plants, the hypocotyl or next leaf 

9 

are employed. The Agrobacterium cells are injected into the 
plant tissue. Generally about l-5jil of lxlO 6 to 1x10, 
cells/ml will be injected. Injection of Agrobacterium into 
cotyledons, nodes and internodes causes a- visible necrosis 
around the wound site. No tumor formation is observed. 
After about one to three weeks, the explants are excised 
from the tissue surrounding the site of injection and 
subcultured in a hormone lacking medium. Callus is observed 
to grow from some of the explants. Opine is present in 
these tissues, while none is detected in non-transformed 
callus. 

Transformation of seed crops such as Brassica can be 
by any of a variety of methods known to those skilled in the 
art. See , for example, Radke et al . (1988) Theor. Appl. 
Genet. 75:685-694 and Radke et al . (1992) Plant Cell Reports 
11:499-505. 



The cells which have been transformed may be grown 
into plants in accordance with conventional ways. See, for 
example, HcCormick et al., Plant Cell Reports (1986) 5:81- 
84, These plants may then be grown, and either pollinated 
5 with the same transformed strain or different strains, and 
the resulting hybrid having the desired phenotypic 
characteristic identified. Two or more generations may be 
grown to ensure that the subject phenotypic characteristic 
is stably maintained and inherited and then fruits or seeds 

10 harvested to ensure the desired phenotype or other property 
has been achieved. 

As a host cell, any plant variety may be employed 
which provides a plant part or tissue of interest. For 
example, where the plant tissue of interest is seed, for the 

15 most part, plants will be chosen where the seed is produced 
in high amounts or a see<3~specif ic product of interest is 
involved. Seeds of interest include the oil seeds, such as 
the Brassica seeds, cotton seeds, soybean, saf flower, 
sunflower, or the like; grain seeds, e.g., wheat, barley, 

2 0 rice, clover, corn, or the like. 

Where the plant part is a fruit, any of a number of 
fruit bearing plants may be employed in which the plant 
parts of interest are derived from the ovary wall. These 
include true berries such as tomato, grape, blueberry, 
25 cranberry, currant, and eggplant; stone fruits (drupes) such 
as cherry, plum, apricot, peach, nectarine and avocado; 
compound fruits (droplets) such as raspberry and blackberry. 
In hesperidium (oranges, citrus) , the expression cassette 
might be expected to be expressed in the "juicy" portion of 

3 0 the fruit. In pepos (such as watermelon, cantaloupe, 

honeydew, cucumber and squash) the equivalent tissue for 
expression is most likely the inner edible portions, whereas 
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in legumes (such as peas, green beans , soybeans) the 
equivalent tissue is the seed pod. 

By use of transcription initiation regions from 
regulated genes, it is found that expression of a structural 
5 gene of interest, either sense or antisense, can be 

regulated in a manner similar to the regulation of the gene 
native to the transcription initiation region. For example, 
by use the soybean SSU promoter, the expression of a gene 
under the control of this promoter is induced by light. 

10 Thus, the expression of the gene is regulatable, where 

enhanced expression occurs during irradiation with light, 
while substantially reduced expression or no expression 
occurs in the absence of light. Similarly, transcription 
initiation regions from genes expressed preferentially in 

15 seed or fruit tissues may be used to control of expression 
of desired DNA sequences JLn these plant tissues. 

By virtue of having a regulatable promoter in the 
soybean plant, one can provide for protection against 
herbicides, by providing a herbicide-resistant gene to be 

20 under the regulatable control of the SSU promoter. For 
example, by employing a mutated aroA gene, the enzyme 5- 
enolpyruvyl-3-phosphoshikimate synthase which is glyphosate- 
resistant can be produced under light induction. Thus, the 
soybean plant may be protected from glyphosate, allowing for 

25 the killing of weeds employing the glyphosate herbicide. 
While glyphosate may be used by itself, particularly for 
pre-emergent spraying and post-emergent control of weeds, 
the glyphosate may also be used with other post-emergent 
broadleaf herbicides, such as Basagran (bentazan) , 

30 Tackle/Blazer (acif lurof en) . Normally, applications will 
vary from about 1.25 to 1.5 lbs/acre, where the herbicides 
may be formulated as dry or wet formulations, by themselves 
or in combination with other additives, such as sticking 
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agents, spreading agents, stabilizers, or the like. Inert 
powders may be used with dry formulations. 

A transcriptional initiation region may be used for 
varying the phenotype of the seeds. Various changes in 
phenotype are of interest. These include modifying the 
fatty acid composition in seeds, that is changing the ratio 
and/ or amounts of the various fatty acids, as to length, 
unsaturation, or the like* Thus, the fatty acid composition 
may be varied by enhancing the fatty acids of from 10 to 14 
carbon atoms as compared to the fatty acids of from 16 to IS 
carbon atoms, increasing or decreasing fatty acids of from 
20 to 24 carbon atoms, providing for an enhanced proportion 
of fatty acids which are saturated or unsaturated, or the 
like. These results can be achieved by providing for 
reduction of expression of one or more endogenous products, 
particularly enzymes or* cof actors , by producing a 
transcription product which is complementary to the 
transcription product of a native gene, so as to inhibit the 
maturation and/ or expression of the transcription product, 
or providing for expression of a gene, either endogenous or 
exogenous, associated with fatty acid synthesis. Expression 
products associated with fatty acid synthesis include acyl 
carrier protein, acyl-ACP thioesterase, acetyl-CoA ACP 
transacylase, acetyl-CoA carboxylase, ketoacyl-ACP 
synthases, malonyl-CoA ACP transacylase, stearoyl-ACP 
desaturase, and other desaturase enzymes. 

A transcriptional initiation region may be employed 
for varying the phenotype of the fruit. Various changes in 
phenotype are of interest. These changes may include up- or 
down-regulation of formation of a particular saccharide, 
involving mono- or polysaccharides, involving such enzymes 
as polygalacturonase, levansucrase, dextransucrase , 
invertase, etc. ; enhance lycopene biosynthesis; cytokinin 
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and monellin synthesis. Other properties of interest for 
modification include response to stress, organisms, 
herbicides, bruising, mechanical agitation, etc., change in 
growth regulators, organoleptic properties, etc. For 
5 antisense or complementary sequence transcription, the 

sequence will usually be at least 12, more usually at least 
16 nt. Antisense sequences of interest include those of 
polygalacturonase, sucrase synthase and invertase. 

Alternatively, one may provide various products from 

10 other sources including mammals, such as blood factors, 
lymphokines, colony stimulating factors, interferons, 
plasminogen activators, enzymes, e.g. superoxide dismutase, 
chymosin, etc., hormones, rat mammary thioesterase 2, 
phospholipid acyl desaturases involved in the synthesis of 

15 eicosapentaenoic acid, and human serum albumin. The level 

of seed proteins, particularly mutated seed proteins, having 
an improved amino acid distribution which would be better 
suited to the nutrient value of the seed can also be 
increased. This can be achieved, for example, by inhibition 

20 of the native seed protein by producing a complementary DNA 
sequence to the native coding region or non-coding region, 
where the complementary sequence does not hybridize 
efficiently to the mutated sequence, or inactivates the 
native transcriptional capability. 

25 A protein is provided having the sequence described 

in the Experimental section designated as 2 All. This 
protein could be a storage protein and be useful in 
enhancing sulfur containing amino acids (cysteine and 
methionine) in the diet. It can be obtained in 

30 substantially pure form by providing for expression in 

prokaryotes or eukaryotes , e.g., yeast by inserting the open 
reading frame into an expression cassette containing a 
transcriptional initiation region, A variety of expression 



cassettes are commercially available or have been described 
in the literature* See/ for example, U.S. Patent Nos. 
4,532,207; 4,546,082; 4,551,433; and 4,559,302. The 
product, if intracellular, may be isolated by lysing of the 
5 cells and purification of the protein using electrophoresis, 
affinity chromatography, HPLC extraction, or the like. The 
product may be isolated in substantially pure form free of 
other plant products, generally having at least about 95% 
purity, usually at least about 99% purity. 
10 The following examples are offered by way of 

illustration and not by limitation. 

The following examples are offered by way of 
illustration and not by way of limitation. 

EXAMPLES 

15 Cloning Vectors 

Cloning vectors used include the pUC vectors, pUC8 
and pUC9 (Vieira and Messing, Gene (1982) 19:259-268); pUC18 
and pUC19 (Norrander et al., Gene (1983) -26:101-106; 
Yanisch-Perron et al., Gene (1985) 33: 103-119), and 

2 0 analogous vectors exchanging chloramphenicol resistance 

(CAM) as a marker for the ampicillin resistance of the pUC 
plasmids described above (pUC-CAM [pUC12-Cm, pUC13-Cm] 
Buckley, K. , Ph.D. Thesis, U.C.S.D., CA 1985). The 
multiple cloning sites of pUC18 and pUC19 vectors were 
25 exchanged with those of pUC-CAM to create pCGN565 and 

pCGN566 which are CAM resistant. Also used were pUC118 and 
pUC119, which are respectively, pUC18 and pUC19 with the 
intergenic region of M13 , from an Egi&X site at 5465 to the 
Aballl site at 5941, inserted at the Ndel site of pUC 

3 0 (available from Vieira J. and Messing, J. Waksman Institute, 

Rutgers Univers ity , Rutgers , N.J.) 



Materials 

Terminal deoxynucleotide transferase (TDT) , RNaseH, 
B. coli DNA polymerase, T4 kinase , and restriction enzymes 
were obtained from Bethesda Research Laboratories; B. coli 
5 DNA ligase was obtained from New England Biolabs; reverse 
transcriptase was obtained from Life Sciences, Inc. ; 
isotopes were obtained from Amersham; X-gal was obtained 
from Bachem, Inc. , Torrance, CA. 

Bacterial strains, olasmids, and media 

10 E. coli strains MK294(F endAl hsdRll supE44 thi"l) 

(Meselson and Yuan, Nature (1968) 217:1110-1114) and 71-18 
(Alac-proAB) supE thi T'lacX* 2 M15 proA + B*) (Messing et 
al., Proc. Natl. Acad. Sci . USA (1977) 74:3642-3646) were 
routinely used for transformations. A. tumefaciens A3 4 8 

15 contains the octopine Ti-plasmid pTiA6 in A114 (Garfinkel 

and Nester, J. Bacterid. (1980) 144:732-743). pRK2073 was 
maintained in HB101(F hs<?20 (r B -r m -) recA13 proA2 lacYl 
ieuB6 rpsL20 thi 1 supE44) (Boyer and Rouland-Dussiox, J. 
Hoi. Biol. (1969) 41:459). 

2 0 Plasmid pRK2073 was generated by insertion of Tn7 

into the Kan r gene of pRK2013. (Ditta et al . , Proc. Natl. 
Acad. Sci. USA (1980) 76:1648-1652) pSR2 . 1 (Berry-Lowe et 
al., 1982; supra.) contains a 2.1 EcoRI fragment of a 
soybean small subunit gene (SSU) in pBR3 25. The Baml9 
25 fragment" of pTiA6 was maintained as a 4 . 6kb subclone in 

PBR325 (pNM33C-19-l) (Thomashow et al . , Cell (1980) 15:729- 
739). pCGN464 contained the 1 . 5kb ffindlll-Sall fragment of 
Tn5 cloned into the sp6 transcription vector pSPSS (Melton 
et al., Nucl. Acids Res. (1984) 12:7035-7056). The pUC7 

3 0 recombinant vector containing the l.Okb BgllX-SmaX fragment 

of Tn5(pCGN546) is designated pCGN546. 
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E. coli were grown on LB media (Miller, 1972, 
Experiments in Molecular Genetics, CSH Laboratory, Cold 
Spring Harbor, NY) . A. tumefaciens were grown in either 
minimal KB medium (Chilton et al., Proc. Natl. Acad. Sci. 
5 USA (1974) 71:3672-3676) or in MG/L (50% LB:50% 

mannitolglutamate medium (Roberts and Kerr, Physiol. Plant 
Pathol. (1974) 4:81-91. 

E. coli strain pCGN1299x7118 was deposited with the 
American Type Culture Collection (A.T.C.C.), 12301 Parklawn 
10 Drive, Rockville, Maryland, 20852 on May 21, 1987 and given 
Accession No. 67408. 

Example 1 

Preparation of Transformed Soybe an Plants 

Soybean (glycine max cv "forrest") seeds were 

15 surface sterilized (12min, 5% sodium hypochlorite, 0.1% 

Tween 80) , washed 3 times in distilled water and germinated 
aseptically (1/10 MS-Gibco, 0*6% phytagar (Gibco) medium 
without hormones, 25 *C red light (Grolux 40W).). 
Agrobacterium containing strains pTiCGN327 and pTiCGN609 

20 were grown overnight (MG/L medium 30* C) were injected into 
hypocotyl, cotyledons, node and internode of two to three 
week old seedlings. Three weeks after injection, tissues 
surrounding the injection site were excised and placed on 
0.6% phytagar MS medium deprived of hormones and containing 

25 0.5g/L carbenicillin. Hormone independent, octopine 

positive tissues were then transferred to liquid MS medium 
and analyzed for the presence of octopine (Otten and 
Schilperoot, Biochem. et Biophys. Acta 1978) 527:497-500). 
To determine kanamycin resistance, growing calli were then 

30 placed in light or complete darkness. Friable calli of 

light grown or dark grown 3 27 and 609 were disaggregated by 



filtering through a 105/1 nylon mesh. Samples (o.lml packed 
cell volume (p.c.v.)) of fine suspensions {1-15 cells/clump 
were placed in the same medium containing 0 to 300mg/L 
kanamycin. Pigmented cells were kept in the light while the 
5 non-pigmented cells were kept in total darkness . The 

effects of kanamycin on growth were measured as packed cell 
volume six weeks later* 

DNA Isolation 

The alkali-lysis procedure of Ish-Horowitz (Maniatis 
10 et al., 21982 Molecular Cloning, A Laboratory Manual, CSH 

Laboratory, Cold Spring Harbor, NY) was used for both large- 
scale plasmid isolation and for mini-prep analysis. Total 
DNA from A* tumefaciens was prepared as described (Currier 
and Nester, J. Bacterid. (1976) 125:157-165. 
15 DNA fragments wez*e isolated from low melt agarose 

gels (Sea Plaque) run in TAE buffer (0.04M Tris-acetate, 
0.002M EDTA (Maniatis , supra.) without ethidium bromide. 
The desired fragment was extracted from the excised agarose 
band by melting at 65oC for 30min followed by phenol 
20 extraction and ethanol precipitation. 

Cloning Procedures 

Restriction enzyme digestions and ligations were 
performed according to manufacturer's instructions. Klenow 
fill-in reactions and transformation were as described 

25 (Maniatis, supra.) When pUC9 was being transformed into J?. 
coli strain 71-18, X-Gal and IPTG were added to the plates 
as described (Miller, 1972, supra). Correct insertion and 
orientation of recombinants were verified by 2 to 3 
restriction digests. 

30 The verification of the Smal-Ddel junction in 

pCGN606 was done by cloning the l.lkb BamRi-EcoRI fragment 



into M13mp9 (Maniatis, suprst). Sequence analysis was then 
performed in accordance with conventional ways. 

Aorobacterium matinas 

The pCGN609 construct was integrated into the 
5 Tiplasmid pTiA6 in a three-way mating (Comai et al., 1983, 
supra) . Overnight E. Coli strains containing cultures of 
pCGN609 and pRK2073, respectively, were mixed with A. 
tujnef aciens strain A722 and spread on AB plates containing 
150;tg/ml kanamycin and 250/ig/ml streptomycin. Single 

10 colonies were restreaked twice. Correct integration was 
verified by Southern analysis of total Agrobacterium DNA. 
BamHI digested DNA was probed with a nick-translated 2.5 
PstI-£coRI 3' ocs fragment from pCGN607. Southern analysis 
and nick translation were performed in accordance with 

15 conventional ways. 

RNA preparation and Northern blo t analysis 

RNA was prepared from soybean callus by a 
modification of the guanidine thiocyanate -procedure of 
Colbert et al . (Proc. Natl. Acad. Sex. USA (1983) 80:2248- 

20 2252) in which the extraction buffer contain 4M guanidine 

thiocyanate, 2% lauryl sarcosine, 1% £-mercaptoethanol, 50mM 
Tris, pH 7.5 20mM EDTA, ImM aurintricarboxylic acid, 0.4% 
antifoam A (Sigma) . PolyA* RNA was purified over oligo-dT 
cellulose (Maniatis, supra.) and Northern gels run as 

25 previously described (Shewmaker et al . , 1984, supra). 32 p- 
RNA bacterial amioglycoside phosphotransferase mRNA 
complementary to (APH (3 ' ) 1 1 -mRNA) (Herrera-Estrella et al., 
EMBO J. (1983) 2:987-995; Bevan et al., Nature (1983) 
304:184-187) was synthesized from BglZl cut pCGN464 using a 

3 0 riboprobe kit (ProMega Biotech) according to the 
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manufacturer's instructions. The hybridization buffers were 
as suggested by the riboprobe manufacturer's with 
hybridization at 55 *C and washes at 60 *C. 

Kanamycin activity blots 
5 The lanaiaycin activity blots (Reiss et al., Gene 

(1984) 30:211) were performed as modified for plants 

(Schreier et al., EMBO J. (1984) : ). For each 

sample, 0.2g of fresh soybean callus was used. 

Construction of soybean ssu-Kan f chimera 

10 A soybean SSU gene (Berry-Lowe), 1982, supra) was 

chosen as the source of the 5' -promoter region. In this 
gene there is a Ddel site, 9pb upstream of the AUG. A Ddel 
digest of pSRS2.l (Berry-Lowe, 1982, supra.) yielded a l.lkd 
5' fragment isolated outr of a low melt agarose gel. The 5' 

15 l.lkb Ddel fragment was filled in with Klenow polymerase and 
ligated into Smal digested pUC9 (Vieira and Messing, Gene 
(1982) 19:259). A clone, pCGN606 was obtained that had the 
SSU promoter facing the adjacent EcoKZ site of pUC9. 

A cassette containing the soybean 5' region and an 

20 appropriate 3' region was then constructed. For this 

cassette, the octopine synthetase (ocs) 3' region was chosen 
as a 2.5kb EcoRI-PstI fragment from a Baml9 subclone of 
pTiA6 (Thomashow, 1980, supra). Since it contained regions 
homologous to T-DNA, it would facilitate transfer to the 

25 Tiplasmid of AgroJbacterium . The cassette pCGN607 was 

obtained in a 3-way ligation with this fragment, the l.lkb 
BamKl-EcoRI 5' soybean SSU fragment from pCGN606, and the 
2.7kb BamHI-Pstl fragment of pACYC177 (Chang and Cohen, J. 
Bacteriol. (1978) 134:1141). 
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The APH(3')-II gene employed was from Tn5, which 
confers resistance to kanamycin both in bacteria (Haas and 
Dowding, tfeth. Enzymology (1975) 43:611-628) and plants 
(Herrera-Estrella, 1983, supra)- A l.Okb Bgl II -Smal 
5 fragment containing the gene was cloned into pUC7 resulting 
in adjacent flanking EcoKl restriction sites. The plasmid 
was digested to provide a l.Okb EcoRl fragment and this 
fragment ligated into JScoRI digested pCGN607. Clones were 
screened for those carrying the Kan f gene of Tn5 in the 

10 correction orientation. One of the clones which had the 
correct orientation was designated pCGN609* The plasmid 
also carried the kanamycin resistance gene from pACYC177 
(APH(3')-I as a bacterial marker* These two kanamycin 
resistance genes (APH(3')-I and -II) do not cross-hybridize 

15 at the nucleic acid level. 

Following Klenow -polymerase fill-in, only 9bp which 
are present upstream of the AUG in native soybean SSU are 
lacking in pCGN609. These 9bp are replaced with 46bp that 
arise from the fusion manipulations. The rest of the l*lkb 

20 soybean SSU 5 'region is the same in pCGN609 as in native 
soybean. 

The integration of pCGN609 into the Ti-plasmid pTiA6 
was accomplished in a three-way (Comai et al. t 1983) mating 
with pRK2073. Correct integration was verified by Southern 

25 analysis of the resulting Agrobacterium, designated 

pTiCGN6=09. In the integration an intact octopine synthetase 
region is maintained as evidenced by the detection of 
octopine. octopine was detected by fluorescence of its 
phenanthroguinone adduct following paper electrophoresis of 

30 tissue extracts (lOmg) • 
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Transfo rmation of soybean 

Transformation of soybean was performed on in vitro 
grown seedlings from the time their cotyledons turned green 
up to the time of the appearance of the second internode. 
In every case, the injection of Agrobacterium caused a 
clearly visible necrosis around the wound site. 
Occasionally, after 1 to 3 weeks, roots would appear, at the 
inoculation site. Splitting also occurred, revealing 
swollen tissue, but in no case was tumor noted with the 
Agrobacterium strains used* Explants excised from the 
tissue surrounding the site of injection were subcultured in 
MS medium deprived of hormones, 0*6% phytagar, 0.5g/L 
carbenicillin. Hormone- independent callus grew from some of 
the explants. Hormone- independent growing tissue for the 
presence of octopine was positive, while no octopine was 
detected in non~transf ofmed soybean tissue. All aerial 
parts of the soybean seedlings, cotyledons, internodes, and 
nodes, were able to produce txansf ormed tissue although no 
systematic study was done to determine which of these areas 
is most susceptible to Agrobacterium. 

Analysis of oolvA* RNA in light and dark grown tissue 

The increase in SSU protein seen in a number of 
light grown plants was shown to correlate with an increase 
in the level of SSU polyA* RNA. Northern analysis of light 
and dairX grown 609 soybean callus was* performed to determine 
if an increase in APH(3')-II polyA + RNA occurred with growth 
in light. The results were determined with a 32 P-RNA probe 
specific for APH(3')-II transcript in the sense orientation. 
An RNA of the expected size of approximately l.6kb was seen 
in both cases of light and dark. Approximately 5-10 times 



as much transcript was seen in the light grown tissue as the 
dark grown tissue. 

Presence of protein with kanamycin phosphotransferase 
activity 

5 APH (3 ' ) -II (aminoglycoside phosphotransferase) 

inactivates kanamycin by phosphorylation. The presence of 
this activity can be demonstrated by a number of assays 
which measure the phosphorylation of kanamycin in vitro. In 
the assay employed (Eeiss et al«, Gezte (1984) 30:211) 

10 extracts are run on an acrylamide gel f reacted in situ with 
kanamycin and ?- 32 P-ATP and then blotted to P81 
(phosphocellulose) paper- For green (light grown) and white 
(dark grown) 609 soybean callus, activity was seen in the 
green soybean at the same mobility as that observed for 

15 purified APH(3')-II, while no detectable activity was seen 
in white 609 tissue or in soybean transformed with an 
Agrobacterium lacking the APH (3 ' ) -II gene. 

Demonstration of kanamycin resistance in the transformed 
tissue 

2 0 Greening of the soybean callus occurred 

spontaneously after exposure to light. Some of the green 
609 callus selected for its friability was disaggregated as 
described previously and used to analyze its resistance to 
kanamycin. It was compared to similar non-pigmented tissue 

25 grown in complete darkness* Dark grown 609 as well as 

control 327 tissue died in the presence of 50mg/L kanamycin , 
while the light grown tissue could survive up to 300mg/L 
kanamycin although its growth was slightly inhibited at this 
concentration . 
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Example 2 
Construction of a Nap in Promoter 

There are 2 98 nucleotides upstream of the ATG start 
codon of the napin gene on the pgNl clone, a 3,3 kb ZcoRI 
fragment of B. napus genomic DNA containing a napin gene 
cloned into pUC8 (available from Marti Crouch, University of 
Indiana) . pgNl DNA was digested with EcoRl and SstI and 
ligated to EcoRI/Sstl digested pCGN706. (pCGN706 is an 
Xhol/PstI fragment containing 3' and polyadenylation 
sequences of another napin cDNA clone pN2 (Crouch et al • , 
1983 supra) cloned in pCGN566 at the Sail and PstI sites ♦) 
The resulting clone pCGN707 was digested with Sail and 
treated with the enzyme Bal 31 to remove some of the coding 
region of the napin gene. The resulting resected DNA was 
digested with Smal after the Ba231 treatment and religated. 
One of the clones, pCGN713 , selected by size, was subcloned 
by EcoRI and SaraHI digestion into both BcoRI-BamHI digested 
pEMBL18 (Dente et al . , Nucleic Acids Res. (1983) 22:1645- 
1655) and pUCllS to give E41S and E4118 respectively. The 
extent of Bal 31 digestion was confirmed by Sanger dideoxy 
sequencing of E418 template. The Bal 31 deletion of the 
promoter region extended only to 57 nucleotides downstream 
of the start codon, thus containing the 5' end of the napin 
coding sequence and about 3 00 bp of the 5' non-coding 
region. E4118 was tailored to delete all of the coding 
region of napin including the ATG start codon by in vitro 
mutagenesis by the method of Zoller and Smith (Nucleic Acids 
Res. (1982) 20:6487-6500) using an oligonucleotide primer 
5 ' -GATGTTTTGTATGTGGGCCCCTAGGAGATC-3 ' . Screening for the 
appropriate mutant was done by two transformations into E. 
coli strain JM83 (Messing J., In: Recombinant DNA Technical 
Bulletin, NIH Publication No. 79-99, 2 No. 2, 1979, pp 4 3- 
48) and Smal digestion of putative transf ormants. The 



resulting napin promoter clone is pCGN778 and contains 298 
nucleotides from the ScoRI site of pgNl to the A nucleotide 
just before the ATG start codon of napin. The promoter 
region was subcloned into a chloramphenicol resistant 
5 background by digestion with ScoRI and BamHI and ligation to 
.EcoRI-BamHI digested pCGN565 to give pCGN779c. 

Extension of the Napin Pr omoter Clone 

pCGN779c contains only 298 nucleotides of potential 
5 ' -regulatory sequence. The napin promoter was extended 

10 with a 1.8 kb fragment found upstream of the 5 '-.EcoRI site 
on the original XBnNa clone. The -3.5 kb Xhol fragment of 
XBnNa (available from M. crouch) , which includes the napin 
region, was subcloned into Sal I -digested pUC119 to give 
pCGN930. A ElndZIX site close to a 5' Xhol site was used to 

15 subclone the Hlndlll-EcoRl fragment of pCGN930 into HindXlI- 
EcoRX digested Bluescript + (Vector Cloning Systems, San 
Diego, CA) to give pCGN942. An extended napin promoter was 
made by ligating pCGN779c digested with £coRI and PstI and 
pCGN942 digested with £coRI and PstI to make .pCGN943 . This 

2 0 promoter contains -2.1 kb of sequence upstream of the 
original ATG of the napin gene contained on XBnNa. A 
partial sequence of the promoter region is shown in 
Figure 1 . 



Napin Cassettes 

25 The extended napin promoter and a napin 3'- 

regulatory region are combined to make a napin cassette for 
expressing genes seed-specifically. The napin 3 '-region 
used is from the plasmid pCGN1924 containing the Xhol-EcoRI 
fragment from pgNl (Xhol site is located 18 nucleotides from 

30 the stop codon of the napin gene) subcloned into .EcoRi-Sall 
digested pCGN565. Hindlll-Pstl digested pCGN943 and 
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PCGN1924 are ligated to make the napin cassette pCGN944, 
with unique cloning sites Smal, Sail, and PstI for inserting 
genes 

Construction of cDNA Library from Spinach Leaves 
5 Total RNA was extracted from young spinach leaves in 

4M guanidine thiocyanate buffer as described by Pacciotti et 
al. (Biotechnology (1985) 3:241-246). Total RNA was 
subjected to oligo(dT) -cellulose column chromatography two 
times to yield poly (A) * RNA as described by Maniatis et al., 

10 (1982) Molecular Cloning: A Laboratory Manual, Cold Spring 

Harbor Laboratory, New York. A cDNA library was constructed 
in pUC13-Cm according to the method of Gubler and Hoffman, 
(Gene (1983) 25:2 63-269) with slight modifications. RNasin 
was omitted in the synthesis of first strand cDNA as it 

15 interfered with second strand synthesis if not completely 
removed, and dCTP was used to tail the vector DNA and dGTP 
to tail" double-stranded cDNA instead of the reverse as 
described in the paper* The annealed cDNA was transformed 
to competent E * coll JM83 (Messing (1979) supra) cells 

20 according to Hanahan (J. Mol . Biol. (1983) 166:557-580) and 
spread onto LB agar plates (Miller (1972) Experiments in 
Molecular Genetics, Cold Spring Harbor Laboratory, Cold 
Spring Harbor, New York) containing 50 ^g/ml chloramphenicol 
and 0-005% X-Gal. 

25 Identification of Spinach ACP-I cDNA 

A total of approximately 8000 cDNA clones were 
screened by performing Southern blots (Southern, J* Mol. 
Biol. (1975) 98:503) and dot blot (described below) 
hybridizations with clone analysis DNA from 40 pools 

30 representing 200 cDNA clones each (see below) ♦ A 5' end- 
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labeled synthetic oligonucleotide (ACPP4) that is at least 
66% homologous* with a 16 amino acid region of spinach ACP-I 
( 5 ' -GATGTCTTGAGCCTTGTCCTCATCCACATTGATACCAAACTCCTCCTC- 3 ' ) is 
the complement to a DNA sequence that could encode the 16 
5 amino acid peptide glu-glu-glu-phe-gly-ile-asn-val-asp-glu- 
asp-lys-ala-gln-asp-ile, residues 49-64 of spinach ACP-I 
(Kuo and Ohlrogge, Arch. Bioch&m. Biophys. (1984) 234:290- 
296) and eas used for an ACP probe* 

Clone analysis DNA for Southern and dot blot 

10 hybridizations was prepared as follows. Transf ormants were 
transferred from agar plates to LB containing 50 pg/ial 
chloramphenicol in groups of ten clones per 10 ml media. 
Cultures were incubated overnight in a 37 °C shaking 
incubator and then diluted with an equal volume of media and 

15 allowed to grow for 5 more hours. Pools of 200 cDNA clones 
each were obtained by mining contents of 20 samples- DNA 
was extracted from these cells as described by Birnboim and 
Doly (Nucleic Acids Res. (1979) 7:1513-1523). DNA was 
purified to enable digestion with restriction enzymes by 

2 0 extractions with phenol and chloroform followed by ethanol 
precipitation* DNA was resuspended in sterile, distilled 
water and 1 fig of each of the 40 pooled DNA samples was 
digested with £coRI and Hindlll and electrophoresed through 
0.7% agarose gels. DNA was transferred to nitrocellulose 

25 filters following the blot hybridization technique of 
Southern . 

ACPP4 was 5' end-labeled using y* 2 F dATP and T4 
kinase according to the manufacturer's specifications. 
Nitrocellulose filters from Southern blot transfer of clone 
30 analysis DNA were hybridized (24 hours, 42 °C) and washed 

according to Berent et al. (SioTechniques (1985) 3:208-220). 
Dot blots of the same set of DNA pools were prepared by 



- 38 - 



10 



applying 1 ^g of each DNA pool to nylon membrane filters in 
0.5 M NaOH* These blots were hybridized with the probe for 
24 hours at 42 °C in 50* f onnamide/1* SDS/1 M NaCl, and 
washed at room temperature in 2X SSC/0.1% SDS (IX SSC - 
0.15M NaCl; 0.015M Na citrate; SDS-sodium dodecylsulf ate) . 
DNA from the pool which was hybridized by the ACPP4 
oligoprobe was transformed to JM83 cells and plated as above 
to yield individual transf ormants. Dot blots of these 
individual cDNA clones were prepared by applying DNA to 
nitrocellulose filters which were hybridized with the ACPP4 
oligonucleotide probe and analyzed using the same conditions 
as for the Southern blots of pooled DNA samples. 



Nucleotide Sequence Analysis 

The positive clone, pCGNlSOL, was analyzed by 
15 digestion with restriction enzymes and the following partial 
map was obtained* 

pUC13-Cm | -35- 1 248 | -63- j 152 | -200 | 
* * 



20 HHN PXh E SXBSmSsE** 

H-JLtndlll N-tfcol P-PvuII Xh-XhoX 
E-£coRI S-Sall X-XJbal Sm-Saial 

B-BajnHI Ss-SstI *-former PstI site destroyed 

with tailing 

25 **polylinker with available restriction sites indicated 



30 



The cDNA clone was subcloned into pOC118 and pUC119 
using standard laboratory techniques of restriction, 
ligation / transformation, and analysis (Maniatis et al., 
(1982) supra) . Single-stranded DNA template was prepared 
and DNA sequence was determined using the Sanger dideoxy 
technique (Sanger et al. , Proa. Nat. Acad, Sex. USA (1977) 
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74:5463-5467) . Sequence analysis was performed using a 
software package from IntelliGenetics, Inc. 

pCGNlSOL contains an (approximately) 700 bp cDNA 
insert including a stretch of A residues at the 3' terminus 
5 which represents the poly (A) tail of the mRNA. An ATG codon 
at position 61 is presumed to encode the MET translation 
initiation codon. This codon is the start of a 411 
nucleotide open reading frame, of which, nucleotides 229-471 
encode a protein whose amino acid sequence corresponds 
10 almost perfectly with the published amino acid sequence of 
ACP-I of Kuo and Ohlrogge supra as described previously* In 
addition to mature protein, the pCGNlSOL also encodes a 56 
residue transit peptide sequence, as might be expected for a 
nuclear-encoded chloroplast protein. 

15 Nar>in-ACP Construct 

pCGN796 was constructed by ligating pCGNlSOL 
digested with Hindi II -BamHI , pUC8-CM digested with Hindlll 
and BamHI and pUCHS digested with BamHI. The ACP gene from 
pCGN796 was transferred into a chloramphenicol background by 

2 0 digestion with BamHX and ligation with BamHI digested 

pCGN565. The resulting pCGN1902 was digested with JScoRI and 
Smal and ligated to EcoRI-Smal digested pUC118 to give 
PCGN1920. The ACP gene in pCGN1920 was digested at the Ncol 
site, filled in by treatment with the Klenow fragment, 
25 digested^ 5 with Smal and re ligated to form pCGN19l9. This 
eliminated the 5 '-coding sequences from the ACP gene and 
regenerated the ATG. This ACP gene was flanked with PstI 
sites by digesting pCGNl9l9 with EcoRI, filling in the site 
with the Klenow fragment and ligating a PstI linker. This 

3 0 clone is called pCGN945. 

The ACP gene of pCGN945 was moved as BamHI-PstI 
fragment to pUCHS digested with BamHI and PstI to create 



pCGN945a so that a SmaX site (provided by the pUCHS) would 
be at the 5 '-end of the ACP sequences to facilitate cloning 
into the napin cassette pCGN944. pCGN945a digested with 
SmaX and PstX was ligated to pCGN944 digested with Sinai and 
PstI to produce the napin ACP cassette pCGN946. The napin 
ACP cassette was then transferred into the binary vector 
PCGN783 by cloning from the Hindlll site to produce pCGN948. 

Construction of the Binary Vector PCGN783 

pCGN783 is a binary plasmid containing the left and 
right T-DNA borders of A* tumefaciens (Barker et al . , Plant 
Mol * Biol. (1983) 2:335-350); the gentamicin resistance gene 
of pPHUI (Hirsch et al . , Plasmid (1984), 12:139-141) the 
35S promoter of cauliflower mosaic virus (CaMV) (Gardner et 
al., Nucleic. Acids Res. (1981) 9:2871-2890), the kanamycin 
resistance gene of Tn5 (Jorgenson et al., infra and Wolff et 
al., Nucleic Acids Res. (1985) 13:355-367) and the 3' region 
from transcript 7 of pTiA5 (Barker et al., (1983) supra). 

To obtain the gentamicin resistance marker, the 
gentamicin resistance gene was isolated as a 3.1 leb JEJcoRI- 
Pstl fragment of pPHIJl cloned into pUC9 yielding pCGN54 9. 
The Hindlll-JBamHI fragment containing the gentamicin 
resistance gene was substituted for the HindXXX-BglXX 
fragment of pCGN587 creating pCGN594. 

pCGN587 was prepared as follows: The HindXXX-SmaX 
fragment of Tn5 containing the entire structural gene for 
APffll (Jorgenson et al, , Mol. Gen. Genet. (1979) 177:65) was 
cloned into pUC8 (Vieira and Messing, Gene (1982) 19:259), 
converting the fragment into a HindXXX-EcoRX fragment, since 
there is an £coRI site immediately adjacent to the SmaX 
site. The Pstl-EcoRX fragment containing the 3 '-portion of 
the APJ5TII gene was then combined with an .EcoRI-BamHI-Sall- 
PstX linker into the EcoRX site of pOC7 (pCGN546W) . Since 



this construct does not confer kanamycin resistance, 
kanamycin resistance was obtained by inserting the Bgl II- 
PstI fragment of the APHTI gene into the BaaHl-Pstl site 
(PCGN54 6X) . This procedure reassembles the APffll gene, so 
that EcoRI sites flank the gene. An ATG codon was upstream 
from and out of reading frame with the ATG initiation codon 
of APHII. The undesired ATG was avoided by inserting a 
Sau3A-PstI fragment from the 5 '-end of APH1I, which fragment 
lacks the superfluous ATG, into the BamHI-PstI site of 
pCGN54 6W to provide plasmid pCGN550. 

The £coRI fragment containing the APS11 gene was 
then cloned into the unique -EcoRI site of pCGN451, which 
contains an octopine synthase cassette for expression, to 
provide pCGN552 (1ATG) . 

pCGN4 5 1 includes an octopine cassette which contains 
about 1556 bp of the 5' rton-coding region fused via an BcoRI 
linker to the 3 ' non-coding region of the octopine synthase 
gene of pTiA6. The pTi coordinates are 11,207 to 12,823 for 
the 3' region and 13,643 to 15,208 for the 5' region as 
defined by Barker et al . , Plant Mol * Biol: (1983) 2:325. 

The 5' fragment was obtained as follows. A small 
subcloned fragment containing the 5' end of the coding 
region, as a BajnHI-.£coRI fragment was cloned in pBR322 as 
plasmid pCGN407. The SamHI-^coRI fragment has an XmnI site 
in the coding region, while pBR322 has two Xmzxl sites. 
PCGN407 was digested with XmnI, resected with Bal31 nuclease 
and JEcoRI linkers added to the fragments* After £coRI and 
BamHI digestion, the fragments were size fractionated, the 
fractions cloned and sequenced* In one case, the entire 
coding region and 10 bp of the 5' non -translated sequences 
had been removed leaving the 5' non-translated region, the 
mRNA cap site and 16 bp of the 5' non-translated region (to 
a BamHI site) intact. This small fragment was obtained by 



size fractionation on a 7% acrylamide gel and fragments 
approximately 130 bp long eluted. 

This size fractionated DNA was ligated into M13mp9 
and several clones sequenced and the sequence compared to 
the known sequence of the octopine synthase gene. The K13 
construct was designated pl4, which plasmid was digested 
with BamHI and .EcoRI to provide the small fragment which was 
ligated to a Xhol to BamKI fragment containing upstream 5' 
sequences from pTiA6 (Garfinkel and Nester, J. Bacterial. 
(198 0) 144:732) and to an EcdKL to Xhol fragment containing 
the 3 ' sequences . 

The resulting Xhol fragment was cloned into the Xhol 
site of a pUC8 derivative , designated pCGN426. This plasmid 
differs from pUC8 by having the sole EcoRI site filled in 
with DNA polymerase I, and having lost the PstI and Hindlll 
site by nuclease contamination of Hindi restriction 
endonuclease, when a Xhol linker was inserted into the 
unique Hindi site of pUC8. The resulting plasmid pCGN451 
has a single EcoRI site for the insertion of protein coding 
sequences between the 5' non-coding region (which contains 
1,550 bp of 5' non-transcribed sequence including the right 
border of the T-DNA, the mRNA cap site and 16 bp of 5 ' non- 
translated sequence) and the 3' region (which contains 2 67 
bp of the coding region, the stop codon, 196 bp of 3' non- 
translated DNA, the polyA site and 1,153 bp of 3' non- 
transcribed sequence) . pCGN451 also provides the right T- 
DNA border. 

The resulting plasmid pCGN451 having the ocs 5' and 
the ocs 3 ' in the proper orientation was digested with ZcoRI 
and the EcoRI fragment from pCGN55l containing the intact 
kanamycin resistance gene inserted into the £coRI site to 
provide pCGN552 having the kanamycin resistance gene in the 
proper orientation. 



This ocs/KAN gene was used to provide a selectable 
marker for the trans type binary vector pCGN587. 

The 5' portion of the engineered octopine synthase 
promoter cassette consists of pTiAS DNA from the Xhol at bp 
5 15208-13644 (Barker's numbering) , which also contains the T- 
DNA boundary sequence (border) implicated in T-DNA transfer. 
In the plasmid pCGN587, the ocs/KAN gene from pCGN552 
provides a selectable marker as well as the right border. 
The left boundary region was first cloned in M13mp9 as a 

10 ffindlll-Smal piece (pCGNS02) (base pairs 602-2213) and 
recloned as a KpnI-EcoRI fragment in pCGN565 to provide 
pCGN580. pCGN565 is a cloning vector based on pUC8-Cm, but 
containing pUClS linkers. pCGN580 was linearized with SamHI 
and used to replace the smaller Bglll fragment of pVCK102 

15 (Knauf and Nester, Plasmid (1982) S:45), creating pCGN585. 
By replacing the smaller Sail fragment of pCGN585 with the 
Xhol fragment from pCGN552 containing the ocs/KAN gene, 
pCGN587 was obtained. 

The pCGN594 jfijidlll-BamHI region, which contains an 

20 5 ' -ocs-kanamycin-ocs-3 ' (ocs is octopine synthase with 5 f 
designating the promoter region and 3' the terminator 
region, see U.S. application serial no. 775,923, filed 
September 13, 1985) fragment was replaced with the Hindlll- 
BamHI polylinJcer region from pUC18. 

25 pCGN566 contains the £coRI-ff±ndIII linker of pUC18 

inserted into the ZcoRI-Hindlll sites of pUC13-Cm. The 
Hindlll-Bglll fragment of pNW31C-8 , 29-1 (Thomashow et al., 
Cell (1980) 19:729) containing ORP1 and -2 of pTiA6 was 
subcloned into the Hindlll-BajrzHI sites of pCGN566 producing 

30 pCGN703 ♦ 

The 5au3A fragment of pCGN703 containing the 3' 
region of transcript 7 {corresponding to bases 2396-2920 of 
pTiA6 (Barker et al*, (1983) supra) was subcloned into the 



BamHI site of pUC18 producing pCGN709. The ScoRI-Smal 
polylinker region of pCGN709 was substituted with the ScoRI- 
Smal fragment of pCGN587, which contains the kanamycin 
resistance gene (APE3-II) producing pCGN726. 
5 The ScoRI-Sall fragment of pCGN726 plus the Bglll- 

ZcoRI fragment of pCGN734 were inserted into the BamHI-Sall 
site of pUCS-Ca producing pCGN738» pCGK726c is derived from 
pCGN738 by deleting the 900 bp ScoRI-£coRI fragment. 

To construct pCGN167, the AZuI fragment of CaMV (bp 

10 7144-7735) (Gardner et a2 . , Nucl * Acid Res. (1981) 9:2871- 
2888) was obtained by digestion with AZuI and cloned into 
the Hindi site of M13mp7 (Messing et al w Nucl. Acids Res. 
(1981) 9:309-321) to create C614. An ZcoRI digest of C614 
produced the EcoRI fragment from C614 containing the 35S 

15 promoter • which was cloned into the J?coRI site-of pUC8 

(Vieira and Messing, Genh (1982) 19:259) to produce pCGNl46. 

To trim the promoter region, the Bgill site (bp 
7670) was treated with BglTl and resected with Sal 31 and 
subsequently a BglXX linker was attached to the Sal 31 

20 treated DNA to produce pCGN147. 

pCGN14 8a containing a promoter region, selectable 
marker (KAN with 2 ATG's) and 3' region, was prepared by 
digesting pCGN528 with Bglll and inserting the BamHI-Bglll 
promoter fragment from pCGN147. This fragment was cloned 

25 into the BglXX site of pCGN528 so that the BglXX site was 
proximal to the kanamycin gene of pCGN528* 

The shuttle vector used for this construct, pCGN528, 
was made as follows. pCGN525 was made by digesting a 
plasmid containing Tn5 which harbors a kanamycin gene 

30 (Jorgenson et al . , Hoi • Gen. Genet. (1979) 177:65) with 
ffindlll-BamHI and inserting the £f±ndIII-BamHI fragment 
containing the kanamycin gene into the Hindi II-BajnHI sites 
in the tetracycline gene of pACYC184 (Chang and Cohen, J. 
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Bact&riol. (1978) 234:1141-1156). pCGN526 was made by 
inserting the BamHI fragment 19 of pTiA6 (Thomashow et al . , 
Cell (1980) 19:729-739)/ modified with Xhol linkers inserted 
into the Sinai site, into the BamHI site of pCGN525. pCGN528 
5 was obtained by deleting the small Xhol fragment from 
pCGN526 by digesting with Xhol and religating. 

pCGN149a was made by cloning the BamHI -kanamyc in 
gene fragment from pMB9KanXXI into the BamHI site of 
pCGN148a. 

10 pMB9KanXXI is a pUC4K variant (Vieira and Messing, 

Gene (1982) 19:259-268) which has the Xhol site missing but 
contains a functional kanamycin gene from Tn903 to allow for 
efficient selection in Agrobacterium . 

pCGN!49a was digested with Bglll and Sphl. This 

15 small BgIII-£phI fragment of pCGN14 9a was replaced with the 
BamHI-Sphl fragment from* MI (see below) isolated by 
digestion with BamHI and Sphl. This produces pCGN167, a 
construct containing a full length CaMV promoter, 1ATG- 
kanamycin gene, 3' end and the bacterial Tn903-type 

20 kanamycin gene. MI is an EcoRI fragment from pCGN54 6X (see 
construction of pCGN587) and was cloned into the EcoRI 
cloning site of M13mp9 in such a way that the PstI site in 
the lATG-kanamycin gene was proximal to the polylinker 
region of Ml3mp9. 

25 The #ii3dIII-BamHI fragment in the pCGN167 containing 

the CaMV-35S promoter, 1ATG -kanamyc in gene and the BamHI - 
fragment 19 of pTiA6 was cloned into the BamHI -Hindi 1 1 sites 
of pUC!9 creating pCGN976. The 35S promoter and 3' region 
from transcript 7 was developed by inserting a 0.7 kb 

30 HlndllX-EcoRI fragment of pCGN976 ( 3 5S* promoter) and the 0.5 
kb EcoRI -Sail fragment of pCGN709 (transcript 7:3') into the 
HindXII-Sall sites of pCG^566 creating pCGN766c. 
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The 0.7 kb HlndXII-EcoRI fragment of pCGN766c (CaMV- 
35S promoter) was ligated to the 1.5 kb ScoRI-Sall fragment 
in pCGN726c ( 1ATG-KAN 3' region) followed by insertion into 
the ffindlll-Sall sites of pUC119 to produce pCGN778. The 
5 2*2 kb region of pCGN778, ifindlll-Sall fragment containing 
the CaMV~35S promoter and 1ATG-KAN-3' region was used to 
replace the ifindlll-Sall linker region of pCGN739 to produce 
pCGN783. 

Transfer of the Binary Vector PCGN948 into Aqroba cte-riuin 
10 pCGN948 was introduced into Agrobacterium 

tumefaciens EHA101 (Hood et ai., J. Bacterid. (1985) 
168:1291-1301) by transformation. An overnight 2 ml culture 
of EHA101 was grown in MG/L broth at 30°C, 0.5 ml was 
inoculated into 100 ml of MG/L broth (Garfinkel and Nester, 
15 J. Bacterid. (1980) 144 T732-743) and grown in a shaking 
incubator for 5 h at 30 °C. The cells were pelleted by 
centrifugation at 7K, resuspended in 1 ml of MG/L broth and 
placed on ice. Approximately, 1 &g of pCGN948 DNA was 
placed in 100 /xl of MG/L broth to which 200 m! of the EHA101 

2 0 suspension was added; the tube containing the DNA-cell mix 

was immediately placed into a dry ice/ethanol bath for 5 
minutes. The tube was quick thawed by 5 minutes in 3 7°C 
water bath followed by 2 h of shaking at 30 °C after adding 
1 ml of fresh MG/L medium. The cells were pelleted and 
25 spread onto MG/L plates (1.5* agar) containing 100 mg/1 
gentamicin. Piasmid DNA was isolated from individual 
gentamicin-resistant colonies, transformed back into E. 
coli, and characterized by restriction enzyme analysis to 
verify that the gentamicin-resistant EHA101 contained intact 

3 0 copies of pCGN948. single colonies are picked and purified 

by two more streakings on MG/L plates containing 100 mg/1 
gentamicin. 
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Transformation and Regeneration of B. Nanus 

Seeds of Brassier napus cv Westar were soaked in 95% 
ethanol for 4 minutes* They were sterilized in 1% solution 
of sodium hypochlorite with 50 jul of "Tween 20" surfactant 
per 100 ml sterile solution. After soaking for 45 minutes , 
seeds were rinsed 4 times with sterile distilled water. 
They were planted in sterile plastic boxes 7 cm wide, 7 cm 
long, and 10 cm high (Magenta) containing 50 ml of l/10th 
concentration of MS (Murashige minimal organics medium, 
Gibco) with added pyridoxine (50 pqfl) , nicotinic acid (50 
#9/1) f glycine (200 M9/1) **>d solidified with 0*6% agar. 
The seeds germinated and were grown at 22 °C in a 16h~8h 
light-dark cycle with the light intensity approximately 65 
MEm~ 2 s' 1 . After 5 days the seedlings were taken under sterile 
conditions and the hypocotyls excised and cut into pieces of 
about 4 mm in length. The hypocotyl segments were placed on 
a feeder plate or without the feeder layer on top of a 
filter paper on the solidified B5 0/1/1 or B5 0/1/0 medium. 
B5 0/1/0 medium contains B5 salts and vitamins (Gamborg, 
Miller and Ojima, Experimental Cell Res. (1968) 50:151-158), 
3% sucrose, 2 , 4-dichlorophenoxyacetic acid (1.0 mg/1) , pH 
adjusted to 5.8, and the medium is solidified with 0.6% 
Phytagar; B5 0/1/ 1 is the same with the addition of l.o mg/1 
kinetin. Feeder plates were prepared 24 hours in advance by 
pipetting 1.0 ml of a stationary phase tobacco suspension 
culture- (maintained as described in Fillatti et al., 
Afolecular General Genetics (1987) 205:192-199) onto B5 0/1/0 
or B5 0/1/1 medium. Hypocotyl segments were cut and placed 
on feeder plates 24 hours prior to Acfrobacterium treatment. 

Agrobacterium tumefaciens (strain EHA101 x 948) was 
prepared by incubating a single colony of Agrobacterium in 
MG/L broth at 30°C. Bacteria were harvested 16 hours later 



and dilutions of 10 s bacteria per ml were prepared in MG/L 
broth. Hypocotyl segments were inoculated with bacteria by 
placing the segments in an Agrrobacterium suspension and 
allowing them to set for 30-60 minutes, then removing and 
transferring to Petri plates containing BS 0/1/ i or 0/1/0 
medium (0/1/1 intends 1 mg/1 2,4-D and 1 mg/1 kinetin and 
0/1/0 intends no kinetin) * The plates were incubated in low 
light at 22 *C. The co-incubation of bacteria with the 
hypocotyl segments took place for 24-48 hours* The 
hypocotyl segments were removed and placed on B5 0/1/ 1 or 
0/1/0 containing 500 mg/1 carbenicillin (kanamycin sulfate 
at 10, 25, or 50 mg/1 was sometimes added at this time) for 
7 days in continuous light (approximately 65 juEWT 2 S" 1 ) at 
22 °C. The segments were transferred to B5 salts medium 
containing 1% sucrose, 3 mg/1 benzylamino purine (BAP) and 1 
mg/1 zeatin. This was supplemented with 500 mg/1 
carbenicillin, 10 , 25, or 50 mg/1 kanamycin sulfate, and 
solidified with 0*6% Phytagar (Gibco) . Thereafter, explants 
were transferred to fresh medium every two weeks. 

After one month green shoots developed from green 
calli which were selected on media containing kanamycin. 
Shoots continued to develop for three months. The shoots 
were cut from the calli when they were at least 1 cm high 
and placed on B5 medium with 1% sucrose, no added growth 
substances, 300 mg/1 carbenicillin, and solidified with 0.6% 
phytagar-. The shoots continued to grow and several leaves 
were removed to test for neomycin phosphotransferase II 
(NPTII) activity* Shoots which were positive for NPTII 
activity were placed in Magenta boxes containing B5 0/1/1 
medium with 1% sucrose, 2 mg/1 indolebutyric acid, 200 mg/1 
carbenicillin, and solidified with 0.6% Phytagar. After a 
few weeks the shoots developed roots and were transferred to 



soil. The plant were grown in a growth chamber at 22 °C in a 
16-8 hours light-dark cycle with light intensity 220 /iEM" 2 s -1 
and after several weeks were transferred to the greenhouse. 



Southern Data 

5 Regenerated S, napus plants from cocultivations of 

Agrobacterium tumefaciens EHA101 containing pCGN948 and B. 
napus hypocotyls were examined for proper integration and 
embryo-specif ic expression of the spinach leaf ACP gene. 
Southern analysis was performed using DNA isolated from 

10 leaves of regenerated plants by the method of Dellaporta et 
al. (Plant Mol. Biol. Rep. (1983) 1:19-21) and purified once 
by banding in CsCl. DNA (10 nq) was digested with the 
restriction enzyme £coRI, electrophoresed on a 0.7% agarose 
gel and blotted to nitrocellulose (see Maniatis et al., 

15 (1982) supra.)* Blots were probed with pCGN945 DNA 

containing 1.8 kb of the spinach ACP sequence or with the 
ScoRI-Hindlll fragment isolated from pCGN936c (made by 
transferring the Hindlll^ScoRI fragment of pCGN930 into 
pCGN566) containing the napin 5' sequences labeled with 32 P- 

20 dCTP by nick translation (described by the manufacturer, BRL 
Nick Translation Reagent Kit, Bethesda Research 
Laboratories, Gaithersburg, MD) ♦ Blots were prehybridi2ed 
and hybridized in 50% formamide, lOx Denhardt's, 5xSSC, 0*1% 
SDS, 5 mM EDTA, 100 /xg/ml calf thymus DNA and 10% dextran 

25 sulfate "(hybridization only) at 42 °C. (Reagents described 
in Maniatis et al. r (1982) supra.) Washes were in IxSSC, 
0.1% SDS, 3 0 min and twice in O.lxSSC, 0.1% SDS 15 min each 
at 55°C. 

Autoradiograms showed two bands of approximately 3*3 
30 and 3-2 kb hybridized in the EcoRl digests of DNA from four 
plants when probed with the ACP gene (pCGN945) indicating 
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proper integration of the spinach leaf ACP construct in the 
plant genome since 3.3 and 3.2 kb EcoRI fragments are 
present in the T-DNA region of pCGN94S. The gene construct 
was present in single or multiple loci in the different 
5 plants as judged by the number of plant DNA-construct DNA 
border fragments detected when probed with the napin 5' 
sequences . 

Northern Data 

Expression of the integrated spinach leaf ACP gene 

10 from the napin promoter was detected by Northern analysis in 
seeds but not leaves of one of the transformed plants shown 
to contain the construct DNA. Developing seeds were 
collected from the transformed plant 21 days postanthesis. 
Embryos were dissected from the seeds and frozen in liquid 

15 nitrogen. Total RNA wa'fc isolated from the seed embryos and 
from leaves of the transformed plant by the method of Crouch 
et al., (1983) supra, electrophoresed on formaldehyde- 
containing 1.5% agarose gels as described (Shewmaker et al., 
Virology (1985) 140:281-288) and blotted to nitrocellulose 

20 (Thomas, Proc. Natl. Acad. Sci. USA (1980) 77:5201-5205). 
Blots were prehybridized, hybridized, and washed as 
described above. The probe was an isolated Pstl-BamHI 
fragment from pCGN945 containing only spinach leaf ACP 
sequences labeled by nick translation. 

25 -An RNA band of -0.8 kb was detected in embryos but 

not leaves of the transformed plant indicating seed-specific 
expression of the spinach leaf ACP gene. 

Example 3 

Construction of B. Carapestris Napin Promoter Cassette 
3 0 A Bglll partial genomic library of B. camp&stris DNA 

was made in the lambda vector Charon 35 using established 
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protocols (Maniatis et a2 . , (1982) supra). The titer of the 
amplified library was -1.2 x 10* phage/ml. Four hundred 
thousand recombinant bacteriophage were plated at a density 
of 10 3 pre 9 x 9 in- NZY plate (NZYM as described in 
5 Maniatis et al., (1982) supra) in NZY + 10 mM MgSo 4 + 0.9% 
agarose after adsorption to DH1 E. coli cells (Hanahan, Mol. 
Biol. (1983) 156:557) for 20 min at 3 7°C Plates were 
incubated at 37°c for -13 hours, cooled at 4°C for 2.5 hours 
and the phage were lifted onto Gene Screen Plus (New England 

10 Nuclear) by laying precut filters over the plates for 

approximately 1 min and peeling them off. The adsorbed 
phage DNA was immobilized by floating the filter on 1,5 M 
NaCl, 0-5 M NaOH for 1 min., neutralizing in 1-5 M NaCl, 0.5 
M Tris-HCl, pH 8.0 for 2 min and 2XSSC for 3 min. Filters 

15 were air dried until just damp, prehybridized and hybridized 
at 42 °C as described for Southern analysis. Filters were 
probed for napin-containing clones using an Xhol-Sall 
fragment of the cDNA clone BE5 which was isolated from the 
B. campestris seed cDNA library described using the probe 

20 pNl (Crouch et al . , (1983) supra). Three plaques were 

hybridized strongly on duplicate filters and were plaque 
purified as described (Maniatis et al . , (1982) supra). 

One of the clones named lambda CGN1-2 was 
restriction mapped and the napin gene was localized to 

25 overlapping 2.7 kb Xhol and 2.1 kb Sail restriction 

fragments. The two fragments were subcloned from lambda 
CGN1-2 DNA into pCGN78 9 (a pUC based vector the same as 
PUC119 with the normal polylinker replaced by the synthetic 
linker - 5' GGAATTCGTCGACAGATCTCTGCAGCTCGAGGGATCCAAGCTT 3' 

30 (which represents the polylinker jEcoRI, Sail, Bgl!!, PstX, 
Xhol, BamHI, Hindu I) . The identity of the subclones as 
napin was confirmed by sequencing. The entire coding region 
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sequence as well as extensive 5' upstream and 3' downstream 
sequences were determined (Figure 2) . The lambda CGN1-2 
nap in gene is that encoding the mRNA corresponding to the 
BE5 cDNA as determined by the exact match of their 
5 nucleotide sequence. 

An expression cassette was constructed from the 5'- 
end and the 3 '-end of the lambda CGN1-2 napin gene as 
follows in an analogous manner to the construction of 
pCGN944. The majority of the napin coding region of pCGN940 

10 was deleted by digestion with Sail and religation to form 
pCGN1800. Single-stranded DNA from pCGN1800 was used in an 
in vitro mutagenesis reaction (Adelman et al», D&A {1983) 
2:183-193) using the synthetic oligonucleotide 5' 
GCTTGTTCGCCATGGATATCTTCTGTATGTTC 3 ' . This oligonucleotide 

15 inserted , an EcoRV and an tfcol restriction site at the 

junction of the promoter' region and the ATG start codon of 
the napin gene. An appropriate mutant was identified by 
hybridization to the oligonucleotide used for the 
mutagenesis and sequence analysis and named pCGNl80l. 

2 0 A 1.7 kb promoter fragment was subcloned from 

pCGN18 0l by partial digestion with ScoRV and ligation to 
pCGN78 6 (a pCGN566 chloramphenicol based vector with the 
synthetic linker described above in place of the normal 
poly linker) cut with EcoRI and blunted by filling in with 
25 DNA Polymerase I Klenow fragment to create pCGN1802. 3' 
sequences from the lambda CGN1-2 napin gene were added to 
XhoI-Hin&III digested pCGN1802 from pCGN941 digested with 
Xhol and fr±ndIII. The resulting clone, pCGN1803, contains 
approximately 1.6 kb of napin 3 '-sequences as well as 

3 0 promoter sequences, but a 3 26 nucleotide Hindlll fragment 

normally found at the 3 ' -end of lambda CGN1-2 is inserted 
opposite to its natural orientation. As a result, there are 
two ftiJidllX sites in pCGN1803. This reversed fragment was 
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removed by digestion of pCGN1803 with Hindlll. Following 
religation, a clone was selected which now contained only 
approximately 1*25 kb of the original 1.6 nap in" 3 ' -sequence. 
This clone, pCGN1808, is the lambda CGN1-2 expression 
5 cassette and contains 1.725 kb of napin promoter sequence, 
and 1*265 kb of napin 3' sequences with the unique cloning 
sites Sail, Bgrll, P&tl, and Xhol in between . Any sequence 
that requires seed-specific transcription or expression in 
Brassica, for example, a fatty acid gene, can be inserted in 
10 this cassette in a manner analogous to that described for 
spinach leaf ACP and the S. napus napin cassette (see 
Example 2) . 

PCGN3 223 Naoin Expression Cassette 

pCGN1808 is modified to contain flanking restriction 

15 sites to. allow movement of only the expression sequences and 
not the antibiotic resistance marker to binary vectors such 
as pCGN1557 (McBride and Summerfelt (1990) PI. Moi . Biol. 
1£: 269-276) . Synthetic oligonucleotides containing Kpnl, 
NotI and Bindlll restriction sites are annealed and ligated 

20 at the unique fTindlll site of pCGN1808, such that only one 
Hindlll site is recovered. The resulting plasmid, pCGN3 2 00 
contains unique ffindlll, NotI and Kpnl restriction sites at 
the 3 '-end of the napin 3 ' -regulatory sequences as confirmed 
by sequence analysis. 

25 The majority of the napin expression cassette is 

subcloned from pCGN3 200 by digestion with Hindlll and Sad 
and ligation to ffindlll and Sad digested pIC19R (Marsh, et 
al. (1984) Gene 32:481-485) to make pCGN3212. The extreme 
5 '-sequences of the napin promoter region are reconstructed 

3 0 by PCR using pCGN3 2 00 as a template and two primers flanking 
the SacI site and the junction of the napin 5 ' -promoter and 
the pUC backbone of pCGN3 2 00 from the pCGN1808 construct. 
The forward primer contains Clal, J3indIII, tfotl, and Kpnl 



restriction sites as well as nucleotides 408-423 of the 
napin 5' -sequence (from the EcoRV site) and the reverse 
primer contains the complement to napin sequences 718-739 
which include the unique Sad site in the 5' -promoter . The 
5 PGR was performed using in a Perkin Elmer/ Cetus thermocycler 
according to manufacturer's specifications* The PCR 
fragment is subcloned as a blunt-ended fragment into pUC8 
(Vieira and Messing (1982) Gene 19:259-268) digested with 
Hindi to give pCGN3217. Sequence of pCGN3217 across the 

10 napin insert verifies that no improper nucleotides were 

introduced by PCR. The napin 5 -sequences in pCGN3217 are 
ligated to the remainder of the napin expression cassette by 
digestion with Clal and Sad and ligation to pCGN3212 
digested with Clal and Sad. The resulting expression 

15 cassette pCGN3221, is digested with Hindlll and the napin 
expression sequences are gel purified away and ligated to 
pIC2 0H (Marsh, supra) digested with Hindi II. The final 
expression cassette is pCGN3223, which contains in an 
ampicillin resistant background, essentially identical 1.725 

20 napin 5' and 1,265 3' regulatory sequences as found in 

PCGN1808. The regulatory regions are flanXed with Hindlll, 
NotI and Kpnl restriction sites and unique Sail, Bgllll, 
PstX, and Xhol cloning sites are located between the 5' and 
3' noncoding regions. 

25 . Example 4 

Isolation of other Seed Specific Promoters 
Other seed-specific promoters may be isolated from 
genes encoding proteins involved in seed triacylglycerol 
synthesis, such as acyl carrier protein from Brassica seeds. 
3 0 Immature seeds were collected from Brassica campestris cv. 
"R-SOQ," a self -compatible variety of turnip rape. Whole 
seeds were collected at stages corresponding approximately 



to 14 to 28 days after flowering. RNA isolation and 
preparation of a cDNA bank was as described above for the 
isolation of a spinach ACP cDNA clone except the vector 
used was pCGN565. To probe the cDNA bank, the 
5 oligonucleotide (5' ) -ACTTTCTCAACTGTCTCTGGTTTAGCAGC- (3 ') was 
synthesized using an Applied Biosystems DNA Synthesizer, 
model 380A, according to manufacturer's recommendations. 
This synthetic DNA molecule will hybridize at low 
stringencies to DNA or RNA sequences coding for the amino 

10 acid sequence (ala-ala-lys-pro-glu-thr-val-glulys-val) . 

This amino acid sequence has been reported for ACP isolated 
from seeds of Brassica napus (Slabas et al., 7th 
International Symposium of the Structure and Function of 
Plant Lipids, University of California, Davis, CA, 1986) ; 

15 ACP from, B . campestris seed is highly homologous. 

Approximately 2200 different cDNA clones were analyzed using 
a colony hybridization technique (Taub and Thompson, Anal. 
Biochem. (1982) 126:222-230) and hybridization conditions 
corresponding to Wood et al . (Proc. Natl* Acad. Sci. (1985) 

20 82:1585-1588). DNA sequence analysis of two cDNA clones 

showing obvious hybridization to the oligonucleotide probe 
indicated that one, designated pCGNlBcs, indeed coded for an 
ACP -precursor protein by the considerable homology of the 
encoded amino acid sequence with ACP proteins described from 

25 Brassica napus (Slabas et al . , 1980 supra). Similarly to 

Example^ 3 , the ACP cDNA clone, pCGNlBCS, was used to isolate 
ACP genomic clones containing the regulatory information for 
expression of ACP during triacylglyceride synthesis in the 
seeds, DNA was isolated from B. campestris cv. R500 young 

3 0 leaves by the procedure of Scofield and Crouch (J. 

Biol .Ch&m. (1987) 2 62:12202-12208). A Sau3A partial genomic 
library of the B. campestris DNA was made in the lambda 
vector Embl 3 (Stratagene, San Diego, CA) using established 
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protocols (Maniatis et al., (1982) supra) and manufacturer's 
instructions. The titer of the library was -1.0x10 s 
phage/ml. Six hundred thousand recombinant bacteriophage 
were plated and screened as described in Example 3 with the 
5 exception that the E. coll host cells used were strain P2392 
(Stratagene, San Diego, CA) . Filters were prehybridized and 
hybridized at 42 °C in 25 ml each of hybridization buffer 
containing 50* formamide, 10X Denhardt's, 5X SSC, 5 Mm EDTA, 
0.1% SDS, and 100 jig/ml denatured salmon sperm DNA (reagents 

10 described in Maniatis et al., (1982) supra). The probe used 
in these hybridizations was 0.2 fig of a nick-translated 53 0 
base pair Bglll-Dral fragment of pCGNlBcs, the B. campostris 
ACP cDNA clone described above* Six plaques were hybridized 
strongly on duplicate filters after washing the filters at 

15 55 °C in 0.1X SSC/0.2% SDS, and were plaque-purified as 
described (Maniatis et a£ . , (1982) supra). 

Restriction analysis followed by Southern 
hybridization was performed on some of the clones using the 
hybridization conditions and radiolabeled probe described 

20 above. One clone, Bcg4-4, contains the ACP gene on two 

overlapping restriction fragments, an -5.1 kb SstI fragment 
and an -1.2 kb Hindlll fragment. These restriction 
fragments were subcloned into the cloning vector pCGN565, 
The DNA sequence of some regions of the subclones verified 

25 by homology that Bcg4-4 is an ACP gene. The sequence also 
shows that this particular ACP gene is expressed in plants, 
as the sequence in the coding region matches exactly the 
sequence of the PCGNLBcs ACP cDNA except for three regions. 
These regions are believed to be intervening sequences, a 

30 common element of eukaryotic genes that is spliced out 
during processing of mRNA (Padgett et al., Ann. Rev. 
Biochem. (1986) 55:1119-1150). Further restriction mapping 
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of the SstI subclone identified an Xhol fragment containing 
-I* 5 kb of 5 ' sequence upstream from the Xhol site near the 
5' end of the PCGNLBCS cDNA clone r This Xhol fragment was 
subcloned in opposite orientations in the cloning/ sequencing 
vector Bluescript + (Stratagene, San Diego, CA) and the 
clones were designated pCGN1941 and pCGN1941'. DNA 
sequencing of l kb of the DNA upstream of the coding region 
was completed. Also, the complete sequence of the 1.2 kb 
Hln&IU subclone described above was determined. The DNA 
sequence derived from the clones described above is shown in 
Figure 3. Additional sequences at the 3' end of the ACP 
gene were subcloned on an -1.6 kb Sstl-Bg-lII fragment into 
Bluescript + and Bluescript - (clones are designated 
PCGN1940 and pCGNl940')- The SstI site in these clones is 
the one found at the 3' end of the ACP coding region of 
PCGNlBcs. 

An expression cassette can be constructed from the 
5' upstream sequences and 3' downstream sequences of Bcg4-4 
as follows* The PCGN1941 Xhol subclone is used for the 5' 
regulatory region. This clone contains the Xhol insert in 
the opposite orientation of the lacZ gene. The 3' 
regulatory region is altered to allow cloning as a Pstl- 
Bglll fragment into PCGN565 by oligonucleotide site-directed 
mutagenesis. Single-stranded DNA is made from pCGN1940 and 
altered by mutagenesis as described (Adelman et al . , supra) 
with the synthetic oligonucleotide 5' 
CTTAAGAAGTAACCCGGGCTGCAGTTTTAGTATTAAGAG 3 ' . This 
oligonucleotide provides Smal and PstI restriction sites 
just after the TAA stop codon of the pCGNlBcs cDNA. The 
Pstl-Bgrlll 3' fragment is then cloned into the PstI and 
BamHI sites (the BamHI restriction site is destroyed in this 
process) of PCGN565. The resulting clone is digested with 
PstI and Smal, and the fragment inserted into the 
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corresponding sites in PCGN1941 (described above) in the 
same orientation as the 5' region. The resulting clone 
comprises the ACP expression cassette with PstX, EcoRI, and 
EcoRV sites available between the 5' and 3' regulatory 
5 regions for the cloning of genes to be expressed under the 
regulation of these ACP gene regions • 

Example 5 

Isolation of Seed-specific cDNA Clone, EA9 
Ninety-six clones from the 14-28 day postanthesis 3. 

10 aampestris seed cDNA library (described in the previous 

example) were screened by dot blot hybridization of miniprep 
DNA on Gene Screen Plus nylon filters (NEN Research 
Products, Boston, MA) . The probes used were radioactively 
labeled first-strand synthesis cDNAs made from the day 14-28 

15 postanthesis seed mRNA or from B . campestris leaf KRNA* 

Clones which hybridized strongly to seed cDNA and little or 
not at all to leaf cDNA were catalogued- A number of clones 
were identified as representing the seed storage protein 
napin by cross-hybridization with an Xhol-Sall fragment of 

20 pNI (Crouch et al . , (1983) supra), a B. napus napin cDNA, 
One of these napin clones, BE5, was used in Example 3 to 
identify a 8, campestris genomic clone as a source of an 
embryo-specif ic promoter . 

Another abundant class of cDNA clones were those 

25 represented by a clone designated EA9 . EA9 cross-hybridized 
to seven other cDNA clones of 600 cDNAs screened by dot blot 
hybridization and was highly expressed in seeds and not in 
leaves. Northern blot analysis of mRNA isolated from day 14 
postanthesis whole seed, and day 21 and 28 postanthesis 

30 embryos using a 7 00 bp £coRI fragment of EA9 (see below) as 
a probe shows that EA9 is highly expressed at day 14 and 
expressed at a much lower level at day 21 and day 28 



postanthesis. Because the embryo is so small at day 14, it 
was suspected that the predominant expression of EA9 might 
be in a tissue other than the embryo* Total RNA was 
isolated (Crouch et al., (1983) supra) from whole seed (14, 
5 15, 17 and 19 days postanthesis), seed coats (day 14 and day 
21 postanthesis) and embryos (day 21 postanthesis) * Twenty- 
five ji<3 of each sample were analyzed by Northern blot 
analysis as described in Example 2. The probe used was a 
0*7 kb -EcoRI DNA fragment isolated from the EA9 cONA and 

10 labeled by nick-translation* The results of the Northern 
analysis showed the EA9 RNA was detected in whole seed at 
all times tested and in seed coats, but not in the embryo* 
A separate Northern analysis of whole seed RNA from days 13 
through day 31 postanthesis (in two day intervals) indicated 

15 that EA9,was highly expressed between days 13 to 21 but was 
barely detectable by day 27 postanthesis* 

In Situ Hybridization 

Seed-coat specific expression of EA9 was confirmed 
by in situ hybridization analysis* Day 14 and 21 

20 postanthesis whole seeds of B * campestris were fixed in a 4% 
paraformaldehyde phosphate buffered saline (PBS) solution* 
The tissue was then dehydrated through a graded tertiary- 
butyl alcohol (TBA) series, infiltrated with paraplast and 
cast into paraffin blocks for sectioning (Berlyn and 

25 Miksche* Botanical Microtechnique ajid Cytochemistry (1976), 
Iowa State University Press) , Five ^m longitudinal sections 
of the embedded seeds (one cell- layer thickness) were 
generated on a Reichert Histostat rotary microtome. The 
paraffin ribbons containing the seed sections were then 

3 0 affixed to gelatin-chrome alum subbed slides (Berlyn 
Miksche, (1976) supra). 
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Single-stranded radiolabeled RNA probes were made 
using the Riboprobe reaction system (Promega, Madison, WI) . 
This system utilizes a vector which is derived from pUC!2 
and contains a bacteriophage SP6 promoter which lies 
5 immediately upstream from an M13 poly linker. First, the 700 
bp £TcoRI fragment was isolated from EA9 and subcloned into 
the polylinker region of the riboprobe vector in both 
orientations (sense and anti-sense) . To generate a template 
for the transcription run-off transcription reactions, the 

10 recombinant plasmids were propagated, purified, and 

linearized with Hindlll. The templates were then incubated 
in a reaction mixture containing the SP6 RNA polymerase, 
triphosphates and 35 S-UTF (as described by the 
manufacturer) • After adding RQ DNase (Promega) , the labeled 

15 RNAs were run over Boehringer pre-packed Sephadex spin 
columns to remove unincorporated triphosphates. 

The slides containing the sectioned seeds were 
hybridized with the radiolabeled sense and anti-sense RNA 
transcripts of EA9 according to the methods of Singer et al . 

20 (Bio techniques (1986) 4.: 230-241) and Taylor and Martineau 

(Plant. Physiol. (1986) 82,: 613-618) • The hybridized slides 
were then treated with nuclear track emulsion NTB-3, 
(Eastman Kodak Company, Kodak Materials for Light Microscope 
Autoradiography, 198 6) sealed in a light-tight box and 

25 exposed for 4 weeks at 5-10 °C. After bringing the slides to 
room temperature they were developed in D-19 developer 
(Eastman Kodak Company) , rinsed, fixed and dehydrated 
through a graded alcohol series. Cover slips were mounted 
with cytoseal (VWR Scientific) . 

3 0 Hybridization of the radiolabeled anti-sense EA9 

riboprobe was seen only in the seed coat tissue of both day 



- 61 - 



14 and 21 seeds. No hybridization of the radiolabeled sense 
EA9 riboprobe was seen in any seed tissues. 

DNA Sequence and Gene Co py Number 

The restriction map and sequence of the EA9 cDNA 
5 clone have been determined (Figure 4) . Identification of a 
polyadenylation signal (Proudfoot and Brovnlee, Nature 
(1976) £62:211-214) and of polyA tails at the 3'-end of EA9 
indicated the orientation of the cDNA clone and the 
direction of transcription of the elRNA* The function of the 

10 encoded protein is unknown at this time* 

EA9 s a member of a small gene family as shown by 
Southern blot analysis, DNA was isolated from B. campestris 
leaves (as described in Example I, Southern analysis), 
digested with either BamHI , Bglll or Hindlll and probed with 

15 a labeled fragment of EA:9 . Three fragments of genomic DNA 
hybridized in both BamHI and Bglll digests. Only 2 bands 
hybridized in the ifindlll digest. The data suggests that the 
EA9 family comprises between one and three genes. 

The sequence of EA9 is used to synthesize a probe 

20 which identifies a unique class of Brassica seed-specific 
genes from a genomic library in the manner described in 
Examples II and III. The regulatory sequences of these 
genes is used to construct an expression cassette similar to 
those described for the napin genes, with the EA9 construct 

25 directing seed coat specific expression of any gene inserted 
in it. 

Example 6 
Other Seed Specific Examples 
Other seed-specific genes also can serve as useful 
3 0 sources of promoters. cDNA clones of cruciferin, the other 
major seed storage protein of B . napus , have been identified 
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(Simon et. al . , (1985) supra) and could be used to screen a 
genomic library for promoters. Without knowing the specific 
functions, yet other cDNA clones can be classified as to 
their level of expression in seed tissues, their timing of 
5 expression (i.e., when postanthesis they are expressed) and 
their approximate representation (copy number) in the B . 
campestris genome. Clones fitting the criteria necessary 
for expressing genes related to fatty acid synthesis or 
other seed functions can be used to screen a genomic library 

10 for genomic clones which contain the 5' and 3 'regulatory 

regions necessary for expression. The non-coding regulatory 
regions can be manipulated to make a tissue-specific 
expression cassette in the general manner described for 
other genes in previous examples. 

15 Example 7 

Construction of "Tomato Ripe Fruit cDNA Bank 
and Screening for Fruit-Specif ic Clones 
Tomato plants (Lycopersicon esculentum cv UC82B) 
were grown under greenhouse conditions. Poly(A) + RNA was 

20 isolated as described by Mansson et al., Afol. Gen. Genet. 
(1985) 200:356-361. The synthesis of cDNA from poly(A) +RNA 
prepared from ripe fruit, cloning into the PstI site of the 
plasmid pUC9 and transformation into an E. coli vector were 
all as described in Mansson et al., Mol* Gen. Genet. (1985) 

25 200:356-361. 

Library .Screening 

Two thousand recombinant clones were screened by 
colony hybridization with radiolabeled cDNA made from tomato 
red fruit mRNA, immature green fruit mRNA, and leaf mRNA. 

3 0 Bacterial colonies immobilized onto GeneScreen Plus filters 
(New England Nuclear) , were denatured in 1.5 M NaCl in 0.5 M 
NaOH, then neutralized in 1.5 M NaCl in 0.5 M Tris-HCl pH 8, 
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and allowed to air dry* Hybridization/ washing and 
autoradiography were all performed as described in Maniatis 
et al., Molecular Cloning: A Laboratory Manual (1982) Cold 
Spring Harbor, New York. 

Sixty-five clones were selected which had more 
intense hybridization signals with fruit cDNA than with leaf 
cDNA and therefore appeared to be under-represented in the 
leaf mRNA population relative to the fruit population • 
Replicate slot blot filters were prepared using purified DNA 
from the selected clones and hybridized with radioactive 
cDNA from leaf, green fruit, and red fruit as before. This 
allowed selection of cDNA clone 2A11, also referred to as 
pCGN1299 which is on at high levels in both the fruit stages 
(red and green) and off in the leaf, 

* Example 8 
Analysis of Clones 
Synthesis of RNA Probes 

The cDNA insert of pCGN1299 was excised as an ZcoRI 
to Hin&IU fragment of approximately 600 bp /as measured on 
an agarose gel) , and subcloned into the Riboprobe vector 
pGEMl (Promega Biotec) , creating pCGN488* 32 P-labeled 
transcripts made from each strand of the pCGN488 insert 
using either SP6 or T7 polymerase were used as probes in 
separate Northern blots containing mRNA from leaf, immature 
green and mature red fruits- The KNA transcript from the 
SP6 promoter did not hybridize to the tomato mRKA* However, 
the transcript from the T7 promoter hybridized to an mRNA of 
approximately 7 00 nt in length from the green fruit and the 
red fruit but not to mRNA from tomato leaf. The direction 
of transcription of the corresponding mRNA was thus 
determined. 



The tissue specificity of the pCGN1299 cDNA was 
demonstrated as follows* RNA from root, stem, leaf, and 
seven stages of fruit development (immature green, mature 
green, breaker, turning, pink, light red, and red) was sized 
5 on formaldehyde/ agarose gels according to the method 
described by Maniatis et al., (1982), immobilized on 
nitrocellulose and hybridized to ^p-labeled SNA which was 
synthesized in vitro from pCGN488 using T7 polymerase. Each 
lane contained 100 ng of polyA* RNA except for two lanes 

10 (pink and light red lanes) which contained 10 fig of total 
RNA. The Northern analysis of mRNA from toot, stem, leaf, 
and various stages of fruit development indicated that 
pCGN1299 cDNA was expressed in all stages of fruit 
development from the early stages immediately after anthesis 

15 to red ripe fruit. No mRNA hybridizing to pCGN1299 was 

found in leaf, stem, or root tissue. The size of the mRNA 
species hybridizing to the pCGN488 probe was approximately 
700 nt. 

Message abundance corresponding to the pCGN1299 cDNA 
20 was determined by comparing the hybridization intensity of a 
known amount of RNA synthesized in vitro from pCGN488 using 
SP6 polymerase to mRNA from red tomato fruit in a Northern 
blot. The ^p-labeled transcript from pCGN488 synthesized in 
vitro using T7 polymerase was used as a probe. The Northern 
25 analysis was compared to standards which indicated that the 
pCGN1299< cDNA represents an abundant mRNA class in tomato 
fruit, being approximately 1% of the message. 
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Example 9 
Sequencing of PCGN1299 and 
pCGN1298 cDNA Clones 

DNA Sequencing 

The polyA* sequence was missing from pCGN1299 cDNA* 
A longer cDNA clone, pCGN1298, therefore was identified by 
its hybridization with the pCGN488 probe. The complete DNA 
sequence of the two cDNA inserts was determined using both 
Maxam-Gilbert and the Sanger dideoxy techniques and is as 
follows. The sequence of pCGN1298 contains additional 
sequences at both the 5' and 3' end compared to pCGN1299. 
As shown in Figure 8 , the sequences are identical over the 
region that the two clones have in common, 
Amino Acid Sequence 

The pCGNl299 cDNA sequence was translated in three 
frames. The longest open reading frame (which starts from 
the first ATG) is indicated- Both pCGN1299 and pCGN1298 
have an open reading frame which encodes a 96 amino acid 
polypeptide (see Figure 8) , The protein has a hydrophobic 
N-terminus which may indicate a leader peptide for protein 
targeting, A hydrophobicity profile was calculated using 
the Hopp and Woods, (Froc. Natl. Acad. Sci. USA (1981) 
75:3824-3828) algorithm. Residues 10-23 have an extremely 
hydrophobic region* A comparison of 2A11 to pea storage 
proteins and other abundant storage proteins is shown in 
Figure 6* The sulfur-rich composite of the fruit-specific 
protein is similar to a pea storage protein which has 
recently been described (see Higgins et al., J. Biol. Chem. 
(1986) 251:11124-11130, for references to the individual 
peptides) . This may indicate a storage role for this fruit- 
specific protein abundant species* 
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Example 10 
Screening Genomic Library 
for Genomic Clones 
Southern Hybridization 

Southern analysis was performed as described by 
Maniatis et al., 1982- Total tomato DNA from cultivar UC82B 
was digested with ScoRI or JEfindlll, separated by agarose gel 
electrophoresis and transferred to nitrocellulose* Southern 
hybridization was performed using a ^-labeled probe 
produced by nic)c translation of pCGN488 (Maniatis et al*, 
1982) • The simple hybridization pattern indicated that the 
gene encoding pCGN1299 cDNA was present in a few or perhaps 
even one copy in a tomato genome. 
Isolation of a Genomic Clone 

A. genomic library established in Charon35/Sau3A 
constructed from DNA of the tomato cultivar VFNT-Cherry was 
screened using the [ 32 P-]-RNA from cDNA clone pCGN488 as a 
probe. A genomic clone containing approximately 12.5 kb of 
sequence from the tomato genome was isolated. The region 
which hybridizes to a pCGN4S8 probe spans an Xbal 
restriction site which was found in the cDNA sequence and 
includes the transcriptional initiation region designated 
2A11. 

Sequence of Genomic Clone 

The DNA sequence of the genomic clone was determined 
by Sanger dideoxy techniques and is as shown in Figure 7. 
The sequence of the genomic clone is identical to the 
pCGN1299 cDNA clone over the region they have in common. 
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Subcloninq 

The region surrounding the Xbal restriction site, 
approximately 2.4 kb in the 5' direction and approximately 
2.1 kb in the 3' direction was subcloned to provide an 
5 expression cassette- The 5' Xhol to Xbal fragment and the, 
3 ' Xbal to EcoRI fragment from the 2A11 genomic clone were 
inserted into a pUC-derived chloromphenicol plasmid 
containing a unique Xhol site and no Xbal site. This 
promoter cassette plasmid is called pCGN1273. 

10 Example 11 

Construction of Fruit- 
Specific Antisense Cassette 
Insertion of Antisense Fragment 

The 2 All genomic fragment was tagged with PG 
15 antisense sequences by insertion of PG into the unique Xbal 
site of the pCGN1273 promoter cassette in the antisense 
orientation. The inserted sequences increased the size of 
the mRNA over the endogenous transcript, and thus the 
expression pattern of the construct could be compared to the 
20 endogenous gene by a single Northern hybridization in a 

manner analogous to the detection of a tuber-specific potato 
gene described by Eckes et al . , Hoi. Gen. Genet. 1986 
205: 14-22. 

Example 12 

25 Insertion of Tagged Genomic Construction 

Into Agrobacterium Binary Vectors 
The tagged genomic construction is excised using the 
flanking Xhol restriction enzyme sites and is cloned into 
the unique Sail site of the binary plasmid pCGN783 (see 
30 Example 2 for construction) containing a plant kanamycin 
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resistance marker between the left and right borders to 
provide plasmid pCGNi269: 

This plasmid binary vector in E. coli C2110 is 
conjugated into A. tumefaciens containing a disarmed Ti- 
5 plasmid capable of transferring the polygalacturonase 

antisense cassette and the kanamycin resistance cassette 
into the plant host genome. 

The Acfrobacterium system which is employed is A. 
tumefaciens PC2760 (G. Ooms et al., Plasmid (1982) 7:15-29; 
10 Hoekema et al., Nature (1983) 303:179-181; European Patent 
Application 84-200239*6, 2424183). 

Example 13 
Transfer of Genomic Construction 
to Tomato via Cocultivation 
15 Substantially sterile tomato cotyledon tissue is 

obtained from seedlings which have been grown at 24 °C, with 
a I6hr/8hr day/night cycle in 100x25 mm petri dishes 
containing Murashige-Skoog salt medium and 0,8% agar (pH 
6.0). Any tomato species may be used, however, here the 

2 0 inbred breeding line was UC82B, available from the 

Department of Vegetable Crops, University of California, 
Davis, CA 95616* The cotyledons are cut into three sections 
and the middle placed onto feeder plates for a 24-hour 
preincubation. The feeder plates are prepared by pipetting 
25 0*5 ml of a tobacco suspension culture (10 6 cells/ml) onto 
0.8% agar medium, containing Murashige minimal organic 
medium (K.C Biologicals) , 2,4-D (0.1 mg/1) , kinetin (1 
mg/1) , thiamine (0.9 mg./l) and potassium acid phosphate 
(200 mg/1, pH 5.5). The feeder plates are prepared two days 

3 0 prior t use. A sterile 3 mm filter paper disk containing 
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feeder medium is placed on top of the tobacco cells after 
the suspension cells are grown for two days. 

Following the preincubation period, the- middle one 
third of the cotyledon sections are placed into a liquid 
5 MG/L broth culture (1-5 ml) of the A. tumefaciens strain. 

The binary plasmid pCGN12 69 is transferred to A. tumBfaciens 
strain 2760 by conjugation or by transformation selecting 
for Gentamicin resistance encoded by the plasmid pCGN1269. 
The cotyledon sections are cocultivated with the bacteria 

10 for 4 8 hrs. on the feeder plates and then transferred to 
regeneration medium containing 500 mg/1 carbenicillin and 
100 mg/1 kanamycin. The regeneration medium is a K.C. 
Biologicals Murashige-Skoog salts medium with, zeatin (2 
mg/1) myo-inositol (100 mg/1), sucrose (20 g/1) , Nitsch 

15 vitamins and containing 0/8% agar (pH 6*0). In 2-3 weeks, 
shoots are observed to develop. When the shoots are 
approximately 1,25 cm, they are excised and transferred to a 
Murashige and Skoog medium containing carbenicillin (500 
mg/1) and kanamycin (50 mg/1) for rooting. Roots develop 

2 0 within 10-12 days. 

Shoots which develop and subsequently root on media 
containing the kanamycin are tested for AFH3'II enzyme. 

An aminoglycoside phosphotransferase enzyme 
<APH3'II) assay is conducted on putative transformed tomato 

25 plants and shoots. APH3'II confers resistance to kanamycin 
and neomycin. APH3'II's activity is assayed (Reiss et al., 
Gene (1984) 30:211-218) employing electrophoretic separation 
of the enzyme from other interfering proteins and detection 
of its enzymatic activity by in situ phosphorylation of 

30 kanamycin. Both kanamycin and [-y~ 32 P] ATP act as substrates 
and are embedded in an agarose gel which is placed on top of 
the polyacrylamide gel containing the proteins. After the 
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enzymatic reaction, the phosphorylated kanamycin is 
transferred to P-81 phosphocellulose ion exchange paper and 
the radiolabeled kanamycin is finally visualized by 
autoradiography . The Reiss et al . , method is modified in 
5 the final washing of the P-81 ion exchange paper by rinsing 
in 0.1 mg/ml of proteinase K. 

gyampje 14 
Construction of Tagged 2A11 Plasmids 
In Binary Vectors 
10 The compete sequence of the 2 All genomic DNA cloned 

into pCGN1273 from the Xhol site (position 1 at the 5' end) 
to the ZcoRI site (position 4654} is shown in Figure 7. 

PCGN12 67 was constructed by deleting from pCGN1273 a 
portion of the plasmid polylinker from the EcoRV site to the 
15 BamHX site* Two DNA sequences were inserted into pCGN1273 
at the unique XJba.1 site (position 2494} . This site is in 
the 3 ' non-coding region of the 2A11 genomic clone before 
the poly A site* 

pCGN1273 was tagged with 360 bp (from base number 1 
2 0 to 3 60) from the 5' region of the tomato polygalacturonase 
(PG) cDNA clone, Fl (Sheehy et al . , Mol. Gen. Genet. (1987) 
205:3 0-3 6) at the unique Xbal restriction enzyme site. The 
tag was inserted in the antisense orientation resulting in 
plasmid pCGN127l and in the sense orientation yielding 
25 plasmid-pCGNl270. Each plasmid was linearized at the unique 
Bglll restriction enzyme site and cloned into the binary 
vector pCGN783 at the unique BamHI restriction enzyme site. 

PCGN1273 was also tagged with a 0.5 kb fragment of 
DNA (base number 1626 to 2115) from a PG genomic clone (see 
30 Figure 8) which spans the 5' end of intron/exon junction. 
This fragment was cloned into the Xbal site resulting in 
plasmid pCGN1215. pCGN1215 was linearized at the unique 



Bglll site and cloned into pCGN783 at the BajztHI site 
resulting in two plasmids, pCGN1219 and pCGN1220, which 
differ only in the orientation of pCGN!215 within pCGN783, 

Three DNA sequences were inserted into pCGN!267 at 
the unique restriction enzyme sites (position 2402, 2406) • 
These sites are in the 3' non-coding region of the 2A11 
genomic clone, 21 bp from the stop codon. The 383 bp Xbal 
fragment from the PG cDNA clone was cloned into the Clal 
site of pCGNl267 after filling in the XJbal and Clal ends 
with Klenow and blunt ligation. The fragment in a sense 
orientation resulted in plasmid pCGN1263 and in the 
antisense orientation gave pCGN1262, pCGN1263 was 
linearized at the unique BgllX site and cloned into pCGN783 
at the BamHI site yielding pCGN1260. pCGN1262 was also 
linearized at the Bglll site and cloned into pCGN783 at the 
JBa^HI site resulting in £wo plasmids, pCGN1255 and pCGN1258, 
which differ only in the orientation of pCGN1262 in the 
binary vector pCGN783* 

The 0.5 kb fragment of the PG genomic clone spanning 
the intron/exon junction (supra) was cloned into pCGNl2 67 at 
the Clal site in an antisense direction yielding plasmid 
pCGN1225. This plasmid was linearized at the Bglll 
restriction enzyme site and cloned in to pCGN733 at the 
BamHl site producing two plasmids, pCGN1227 and pCGN1228, 
which differ only in the orientation of pCGN1225 in the 
binary vector. 

The £Tco7 fragment (base numbers 5545 to 12 , 823) 
(Barker et al . , Plant Mol . Biol. (1983) 2:335-350) from the 
octopine plasmid pTiA6 of A* tumefaciens (Knauf and Nester, 
Plasmid (1982) 8:45*54) was subcloned into pUC19 at the 
EcoRl site resulting in plasmid pCGN7l. A Rsal digest 
allowed a fragment of DNA from bases 8487 to 903 6 of the 
Eool fragment to be subcloned into the vector ml3 BlueScript 
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Minus (Stratagene, Inc.) at the Sinai site resulting in 
plasmid pCGNi278 . This fragment contains the coding region 
of the genetic locus designated tmr which encodes a 
dimethylallyl transferase (isopentenyl transferase) 
(AXiyoshi et al. r Proc. Natl. Acad* Sex. USA (1984) 82:5994- 
5998; Barry et a2., ibid (1984) 82:4776-4780). An 
exonuclease/mung bean treatment (Promega Biotech) produced a 
deletion on the 5' end of the tmr gene to a point 39 base 
pairs 5' of the start codon. The tmr gene from pCGN1272 was 
subcloned into the C2al site of pCGN1267, The tmr gene in 
the sense orientation yielded pCGN1261 and in the antisense 
orientation gave plasmid pCGN1266. pCGN1261 was linearized 
at the Sglll site and cloned into pCGN783 at the BamHI site 
resulting in plasmid pCGNl254. pCGNl266 was also linearized 
at the Bgrlll site and subcloned into pCGN783 at the BamHI 
site yielding two plasmids, pCGN1264 and pCGN1265, which 
differ only in the orientation of pCGH1266 in pCGN783. 
Analysis of Expression in Transgenic Plants 

Immature green fruit (approximately 3*2 cm in 
length) was harvested from two tomato plants cv. UC82B that 
had been transformed with a disarmed Agrobactarium strain 
containing pCGN!264. Transgenic plants are designated 1264- 
1 and 1264-11. The pericarp from two fruits of each plant 
was ground to a powder under liquid N 2 , total RNA extracted 
and polyA* mRNA isolated (as described in Mansson et al., 
Mol. Gen. Genet. (1985) 200:356-361). Young green leaves 
were also harvested from each plant and polyA* mRNA 
isolated. 

Approximately 19 /xg of total RNA from fruit, 7 0 ng 
of polyA* mRNA from fruit and 70 ng of polyA + mRNA from 
leaves from transformed plants 1264-1 and 1264-11 was run on 
a 0.7* agarose formaldehyde Northern gel and blotted onto 



nitrocellulose (Maniatis et al., Molecular Cloning: A 
Laboratory Manual (1982) Cold Spring Harbor, New York), 
Also included on the gel as a negative control .was 
approximately 50 ng of polyA* mRNA from leaf and immature 
5 green fruit of a nontrans formed UC82B plant.. 

As a positive control and to help in quantitating 
mRNA levels, in vitro transcribed RNA from pCGN1272 was 
synthesized using T3 polymerase (Stratagene, Inc.)- 
Nineteen pg and 1.9 pg of this in vitro synthesized RNA were 

10 loaded on the Northern gel* 

The probe for the Northern filter was the 1,0 kb tmr 
insert DNA (a Kpnl to SAcI fragment) from pCGN1272 isolated 
by electroelution from an agarose gel (Maniatis, supra 
(1982)) and labeled by nick translation (Bethesda Research 

15 Laboratory kit) using a 32 P dCTP (Amersham) . 

The Northern filler was prehybridized at 42 °C for 5 
hrs. in the following solution: 25 ml formamide, 12,5 ml 20X 
SSC, 2*5 ml 1 M NaP, 5 ml SOX Denhardts, 0,5 ml 10% SDS, 1 
ml 250 mM EDTA, 1 ml 10 mg/ml ssDNA and 2 ml H 2 0. Then one- 

20 fifth volume of 50% dextran sulfate and approximately 2.2X 
10 7 cpm of the probe was added and hybridization was for 15 
hrs. at 42°C. 

The Northern filter was washed one time in 2X SSC 
and 0.1% SDS at 55° for 2 0 minutes each wash. The filter 

25 was allowed to air dry before being placed with Kodak XAR 
film and an intensifying screen at "70° for two days. 
Northern Results on Transgenic Plants 

The nicked tmr probe hybridized with a mRNA species 
approximately 1.7 kb in length was observed in the total RNA 

3 0 and polyA + mRNA fruit lanes of the Northern blot. This is 
the expected length of the reintroduced 2A11 gene (0.7 kb) 
tagged with the tmr gene (1.0 kb) in the antisense 
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orientation* The level of expression from the reintroduced 
tagged gene is somewhat lover than the level of expression 
of the endogenous 2A11 gene* The level of expression of the 
reintroduced gene in immature green fruit is higher than the 
5 expression level in leaf tissue with a small amount of 
hybridizing mRNA in leaf tissue in these transf ormants. 

Example 15 
Different Sized 2A11 5 < Regions 
The design of the 2A11 cassette is shown in 
10 Figure 9- The cassette contains 3.8 kb of DNA 5' of the 
transcriptional start site and the entire 3' region (from 
the TGA stop codon to a site 2.0 kb 3' of the poly A 
addition site) of the 2A11 gene. Figure 7 shows the 
restriction sites and indicates (below the representation of 
15 the gene) the regions of, the 2A11 gene used to construct the 
2A11 cassette. The 2A11 cassette was constructed as 
follows. 

Transcriptional Initiation Region 

The 5 7 end of the 2A11 cassette was constructed 

2 0 starting with an EcoRX subclone genomic clone as described 
in application PCTUS88/01811 cloned into the ScoRI of 
Bluescript ( + ) (Stratagene) resulting in pCGN1288. This 
clone contains sequences from the EcoRI site at position 
1651 in the intron of the 2 All gene to the EaoRI site 

25 located. 2. 5 Kb upstream of the Xhol site at position 1 of 

the sequenced region (see Figure 7) . The XHOI fragment from 
position 1 of the sequenced region to the XHOI site in the 
Bluescript polylinker was deleted creating plasmid pCGN2004 
which contain the 2A11 region from position 1 to position 

30 1651. The coding region of 2A11 was deleted by treating 

this plasmid with Exonucleaselll/Sl using the commercially 
available Erase-a-Base Kit (Promega Biotec) and sequencing 



deletion plasmids until one was found which had the coding 
region deleted to position 1366 . The resulting plasmid, 
pCGN1251, had the genomic region from the Xhol site 
(position 1) to position 13 66* The EcoRI fragment of 
5 pCGN1288 was then transferred to a chloramphenicol resistant 
plasmid vector, pCGN2 015, to make pCGN1231. pCGN2015 is a 
Cm resistant derivative of the Bluescript plasmid. A 
BstEII/BamHI fragment of pCGN12Sl was then transferred into 
BstEII/BamHI digested pCGN1231 to make pCGN1235 which 
10 contains the region from the ScoRI site (2.5 kb upstream of 
the sequenced region) to position 13 66 of the sequenced 
region flanked by the Bluescript polylinker in a Cm 
resistant vector . 

Transcriptional and Translational Termination Region 
15 The 3' end of the 2A11 cassette was constructed from 

pCGN1273 (described in application PCT/US8801811) by 
digesting the plasmid with Pvul and EcoRI, isolating the 
2249 bp insert (from position 2402 to 4653), ligating with a 
double-stranded oligonucleotide containing the sequence 

2 0 shown in Figure 7 from the BamHI sticky end to a Pvul sticky 

end into a Bluescript vector which had been digested with 
BamHI and EcoRI. The resulting plasmid, pCGN1238, contains 
the 3' end of the 2A11 gene from the stop codon at position 
2381 to the EcoRI site at position 4653. 
25 Final Cons tru c t i on 

-Several versions of the 2A11 cassette in different 
vectors with different flanking restriction sites have been 
constructed; maps of the plasmids are shown in Figure 10. 

A cassette containing the 5 ' and 3 ' regions of the 

3 0 2 All gene was constructed by ligating the BamHI to EcoRI 

insert of pCGN1238 into pCGN1235 which had been digested 
with BamHI and Xbal (the Xbal site having been filled in 
with Klenow polymerase to make a blunt-ended fragment) . The 



resulting plasmid, pCGN124 0, has the 5' end of the 2 All gene 
from the EcoRI site 2.5 kb upstream of the Xhol site 
(position 1) to position 13 66 (which is located between the 
transcriptional initiation site of the 2A11 gene and the 
ATG) , followed by a polylinker region with sites for Smal, 
BamHI, PstI and Sail which can be conveniently used to 
insert genes followed by the 3' region from position 2381 to 
4653. The plasmid backbone of pCGN1240 is the Bluescript Cm 
plasmid described above. 
Construction of Plasmid PCGN1241 

A more convenient version has the EcoRI of pCGN1240 
excised and inserted into a Bluescript vector called 
PCGN123 9 which has an altered polylinker region such that 
the entire cassette can be excised as a Sacl-Kpnl fragment. 
The altered Bluescript vector, pCGN123 9, was constructed by 
modifying the Bluescript polybinder from the Sad site to 
the KpnX site including a synthetic polylinker with the 
following sequence: AGCTCGGTACCGAATTCGAGCTCGGTAC to create 
a polylinker with the following sites: SacI-KpnI-EcoRI Sacl- 
Kpnl. The EcoRI insert of pCGN1240 was inserted into 
PCGN1239 to make pCGN1241 (see Figure 9) . 
Constru ction of PCGN2 610 and PCGN2611 

A chloramphenicol resistant version of the 2A11 
promoter cassette was constructed by inserting the synthetic 
polylinker described above (see construction of pCGN1241) 
into pCGN2 015 to make pCGN1246, followed by insertion of the 
-EcoRI fragment of pCGN1241 to make pCGN2610 and pCGN2 611 
which differ only by the orientation of the inserted 
fragment in the plasmid vector (see Figure 8} . 
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Example 16 
Comparison of Expressio n from Different 
Sized 2A11 5 / Regions 
A beta-glucuronidase (Gus) reporter gene was used to 
5 evaluate the level of expression and tissue specificity of 
various 2A11-Gus constructions* Gus is a useful reporter 
gene in plant systems because it produces a highly stable 
enzyme, there is little or no background (endogenous) enzyme 
activity in plant tissues, and the enzyme is easily assayed 

10 using fluorescent or spectrophotometry substrates. See, 
for example, Jefferson Plant Mol. Biol. Rep. (1988) 5:387- 
405 . Histochemical stains for Gus enzyme activity are also 
available which can be used to analyze the pattern of enzyme 
expression in transgenic plants- Jefferson (1988), supra* 

15 Constructions containing 1.3 kb (short), 1.8 kb 

(intermediate length), or 3.8 kb (long) 2A11 5' sequences 
fused to the Gus reporter gene were prepared. In addition, 
constructions were prepared which have altered 3' ends. The 
altered 3' ends are either a shorter 2A11 3' end from trS of 

20 the T-DNA of the Ti plasmid (Willmitzer et a!., Embo. J. 

(1982) 1:139-146; Willmitzer et al . , Cell (1983) 42:1045- 
1056. The constructions were transferred to a binary vector 
(pCGN1578), and used in A* Tumefaciens cocultivations. The 
resulting binary was used to transform tomato plants. The 

25 transgenic plants obtained were f luorometrically analyzed 
for Gus -enzyme activity. 

Example 17 
Screening Genomic Library for 
Polygalacturonase Genomic Clones 
3 0 Isolation of a Genomic Clone 

An EcoRl partial genomic library established in 
Charon 4 constructed from DNA of a Lycopersicon esculentum 
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cultivar was screened using a probe from the 
polygalacturonase cDNA (Sheehy et al. # Mol. Gen. Genet. 
(1987) 208:30-36). A lambda clone containing an 
approximately 16 kb insert was isolated from the library, of 
5 which an internal 2207 bp HindHI to EcoRl was sequenced. 
The £findIII~.£coRI fragment includes the polygalacturonse 
promoter region. 
Sequence of Genomic Clone 

The DNA sequence of the genomic clone was determined 
10 by Sanger dideoxy techniques and is as shown in Figure 8* 
The sequence of the genomic clone bases 1427 to 1748 are 
homologous to the polygalacturonase cDNA sequence. 

The above results demonstrate the ability to 
identify inducible regulatory sequences in a plant genome, 
15 isolate the sequences and manipulate them. In this way, the 
production of transcription cassettes and expression 
cassettes can be produced which allow for differentiated 
cell production of the desired product. Thus, the phenotype 
of a particular plant part may be modified, without 

2 0 requiring that the regulated product be produced in all 

tissues, which may result in various adverse effects on the 
growth, health, and production capabilities of the plant. 
Particularly, fruit-specific transcription initiation 
capability is provided for modifying the phenotypic 
25 properties of a variety of fruits to enhance properties of 
interest such as processing, organoleptin properties, 
storage, yield, or the like. Further, the results 
demonstrate one can use transcriptional initiation regions 
associated with the transcription of sequences in seeds in 

3 0 conjunction with sequences other than the normal sequence to 

produce endogenous or exogenous proteins or modulate the 
transcription of expression of nucleic acid sequences. In 
this manner, seeds can be used to produce novel products, to 



provide for improved protein compositions, to modify the 
distribution of fatty acid, and the like* 

It is also evident from the above results that not 
only can soybean be transformed, so as to introduce 
5 heterologous genes, but transformed soybean cells may be 
regenerated into plants and the plants demonstrate the 
phenotype of the heterologous gene* In addition, native 
promoters can find use in conjunction with heterologous 
genes and retain their capability to be induced in the same 

10 manner as the native gene. Therefore, one can provide for 
regulated expression of a heterologous gene, where 
regulation may be by an external condition, such as light* 
Furthermore, Ti- or Ri-DNA may be employed for introducing 
the heterologous gene as part of an expression cassette into 

15 the soybean cell without formation of a tumor and the 

resulting cells grown in, culture and plants regenerated from 
the cells* By appropriate choice of various genes, various 
properties of the cell may be enhanced by introduction of 
additional copies of a homologous gene or new phenotypes may 

2 0 be provided by expression of heterologous genes. In 

addition, mutated genes may be employed which can impart 
novel properties to the host cell, providing for host 
resistance to biocides, enhanced production of specific 
metabolites or products at the same or different times from 
25 the normal regulated expressions, or the like. 

All publications and patent applications mentioned 
in this specification are indicative of the level of skill 
of those skilled in the art to which this invention 
pertains. All publications and patent applications are 

3 0 herein incorporated by reference to the same extent as if 

each individual publication or patent application was 
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specifically and individually indicated to be incorporated 
by reference. 

Although the foregoing invention has been described 
in some detail by way of illustration and example for 
purposes of clarity of understanding, it will be obvious 
that certain changes and modifications may be practiced 
within the scope of the appended claims. 



WHAT IS CLAIMED IS: 

1. A DNA construct comprising as operably linked components 
in the direction of transcription, a promoter region obtainable 
from a gene selected from the group consisting of a napin gene, an 
EA9 gene or an acyl carrier protein gene'; a DNA sequence of 
interest other than the native coding sequence of said gene; and a 
transcription termination region, wherein said components are 
functional in a plant cell, and wherein said DNA construct is 
flanked by T-DNA. 

2. The DNA construct according to Claim 1, wherein said DNA 
sequence of interest encodes an enzyme. 

3. The DNA construct according to Claim 1, wherein said DNA 
sequence of interest is an antisense sequence. 

4. A plant cell having an altered phenotype as a result of 
expression of a DNA construct according to Claim 1. 

5. The plant cell according to Claim 4, wherein said DNA 
construct is flanked by T-DNA. a 

6. The plant cell according to Claim 5 wherein said cell is 
one from the group consisting of a soybean cell and a rapeseed 
cell. 

7. The plant cell according to Claim 4, wherein said DNA 
sequence of interest encodes an enzyme* 

8. The plant cell according to Claim 4, wherein said DNA 
sequence of interest is an antisense sequence. 

9. A plant comprising cells comprising a DNA construct 
according to any one of Claims 1-3. 

10. The .plant according to Claim 9, wherein said plant is 
dicotyledonous . 

11. Seed obtained from a plant according to Claim 10. 

12. Seed having a DNA construct according to Claim 1. 

13. The seed according to Claim 12, wherein said seed is an 
oil seed or a grain seed. 

14. The seed according to Claim 12, wherein said seed is 
from a dicotyledonous plant . 

15. The seed according to Claim 14, wherein said seed is 
from a plant of the genus Brassica. 
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16. The seed according to Claim 14, wherein said 
dicotyledonous plant is selected from the group consisting of 
cotton, soybean, saf flower and sunflower. 

17 . A method for obtaining a plant having a modified 
phenotype, said method comprising ; 

transforming a host plant cell with a DNA construct under 
genomic integration conditions, wherein said construct comprises 
as operably linked components in the direction of transcription, a 
promoter region obtainable from a gene, wherein transcription of 
said gene is regulated in plant seed tissue, a DNA sequence of 
interest other than the native coding sequence of said gene, and a 
transcription termination region, wherein said components are 
functional in a plant cell, 

whereby said DNA construct becomes integrated into a genome 

of said plant cell; 

regenerating a plant from said transformed plant cell, and 
growing said plant under conditions whereby said DNA sequence 

of interest is expressed and a plant having said modified 

phenotype is obtained. 

18. A method for altering the phenotype of plant seed tissue 
as distinct from other plant tissue, said method comprising: 

growing a plant, wherein said plant comprises cells 
containing a DNA construct integrated into their genome, said DNA 
construct comprising, in the 5 f to 3 1 direction of transcription, 
a transcriptional initiation region from a gene, wherein 
transcription of said gene is regulated in a plant seed tissue, a 
DNA sequence of interest other than the coding sequence native to 
said transcriptional initiation region, and a transcriptional 
termination region, whereby said DNA sequence of interest is 
transcribed under transcriptional control of said transcriptional 
initiation region and a plant having an altered phenotype is 
obtained. 

19. The method according to Claim 17 or 18, wherein said DNA 
construct is flanked by T-DNA. 

20. The method according to Claim 19, wherein said plant is 
a soybean or rapeseed plant . 
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21. The method according to Claim 17 or 18 wherein said DNA 
sequence of interest encodes an enzyme. 

22. The method according to Claim 17 or 18 wherein said DNA 
sequence of interest is an antisense sequence. 

23. The method according to Claim 17 or 18 wherein said gene 
is transcribed during seed embryogenesis . 

24. The method according to Claim 23 wherein said gene is 
transcribed from about day 7 to day 40 postanthesis . 

25. The method according to Claim 17 or 18 wherein said gene 
is transcribed during seed maturation. 

26. The method according to Claim 25 wherein said gene is 
transcribed from about day 11 to day 30 postanthesis . 

27. The method according to Claim 18, wherein said 
transcriptional initiation region further comprises a 
translational initiation region and said DNA sequence of interest 
is an open reading frame encoding an amino acid sequence. 

28. A method for modifying the genotype of a plant to impart 
a desired characteristic to seed as distinct from other plant 
tissue, said method comprising: 

transf orming under genomic integration conditions, a host 
plant cell with a DNA construct comprising in the 5* to 3 1 
direction of transcription, a transcriptional initiation region 
from a gene, wherein transcription of said gene is regulated in a 
plant seed tissue, a DNA sequence of interest other than the 
native coding sequence of said gene, and a transcriptional 
termination region, whereby said DNA construct becomes integrated 
into the genome of said plant cell; 

regenerating a plant from said transformed host cell; and 
growing said plant to produce seed having a modified 
genotype. 

29. The method according to Claim 28, wherein said DNA 
construct is flanked by T-DNA. 

30. The method according to Claim 28, wherein said plant is 
a Brassica plant. 

31. The method according to Claim 28, wherein said DNA 
sequence of interest encodes an enzyme. 
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32. The method according to Claim 28, wherein said DNA 
sequence of interest is an antisense sequence. 

33. The method according to Claim 28, wherein said plant is 

a soybean plant . 

34. A method for modifying transcription in seed tissue as 
distinct from other plant tissue, said method comprising: 

growing a plant capable of developing seed tissue under 
conditions to produce seed, wherein said plant comprises cells 
containing a DNA construct integrated into their genome, said DNA 
construct comprising, in the 5 1 to 3 " direction of transcription, 
a seed-specific transcriptional initiation region, a DNA sequence 
of interest other than the coding sequence native to said 
transcriptional initiation region, and a transcriptional 
termination region, whereby said DNA sequence of interest is 
transcribed under transcriptional control of said seed-specific 
transcription initiation region. 

35. The method according „ to Claim 34, wherein said DNA 
sequence of interest is an antisense sequence. 

36. The method according to Claim 34, wherein said plant is 
of the genus Brassica.. 

37. The method according to Claim 34, wherein said 
transcriptional initiation region further comprises a 
translational initiation region and said DNA sequence of interest 
is an open reading frame encoding an amino acid sequence. 

38. The method according to Claim 34, wherein said plant is 
a soybean plant . 

39. A method to selectively express a heterologous DNA 
sequence of interest in seed tissue as distinct from other plant 
tissue, said method comprising: 

growing a plant capable of developing seed tissue under 
conditions to produce seed, wherein said plant comprises cells 
having a genomically integrated DNA construct comprising, as 
operably linked components in the 5 ' to 3' direction of 
transcription, a seed-specific transcriptional initiation region 
and a translational initiation region, a DNA sequence of interest 
other than the coding sequence native to said transcriptional 
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initiation region, a transcriptional termination region downstream 
of said DNA sequence of interest, whereby said DNA sequence of 
interest is expressed under control of said seed-specific 
transcriptional and translational initiation region. 

40. The method according to Claim 39, wherein said plant is 
of the genus Brassica. 

41. The method according to Claim 39, wherein said plant is 
a soybean plant . 
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CGNE 99-2 



TITLE OF THE INVENTION 



METHODS AND COMPOSITIONS FOR 
TRANSCRIPTION AND EXPRESSION OF HETEROLOGOUS GENES 

Abstract: of the Disclosure 

Regulatory regions from genes expressed during a 
particular developmental stage or in a specific tissue are 
identified employing cDNA screening* Tlie resulting 
regulatory regions are manipulated for use with foreign 
sequences for introduction into plant cells to provide 
transformed plants having phenotypic property which can be 
modulated. The invention is exemplified with light , seed 
and a fruit-specific promoters. 



87 



vo 



£-1 O VO 



00 

m 



o 

M 

< 



H 

H 2 

M <H 



I 





M 




u 




rC J— ( 


— U in 




q o 




— <5 in 





ST 
IS- 



> o 
u 

s 

1 

a 

a 

i: 
I 

o 

S 
8 



a 



H 

8- 



a 

I 

8 

Eh 



O 

in 



h a 
< Q — h 



m 

id 
CO 



o 
m 



in 
o 



o 

H 
fa 



o 

i 

a 
a 
e-* 
a 
u 

a 

u 

9 

I 

CM 



CO 
GO 



CN 
ID 



CN 



o 
o 



1 



GO 
, O 



HI <, 

a em 



O O 

l cr* ^3* ^ 
co ^l* ^ 



s s 

1 

I s 

8 E 



H 



H 

O 









-a c- 












o 




CD 




a 

Q 




ATA 








a 




u 












s 




o 




o 




u 
















1 


H 






U CM 


r-l 


O 




— o r* 







! 
if 



r-t 



CO 
CO 



00 



8^- 



q 

1 

o 

g 
g 

8 

a 
o 

E- 

u 

a 

o 

8 

a 

O 



m 
m 
o 



o 




U 



O 
H 
fa 



CM 



o 

00 

n 



CO 



M 

c 

X* 




p 



H 
fa 



in 

CM 





o 
5* 




<; 
ft 




o 


M 




(S 




W 


CJ in 


a: 








g 








o 
























8 








CJ 












§ 












l 












q 



I 
I 

o 
o 

e- 
e* 
o 

% 

o 
o 
o 

a 
a 

r- 
in 













C- 




t-l 








o o 


C 






rtl r-* co 




— < r — 1 00 








— a r-» 












i 







CO 



m 
cn 

O 

o 
o 

% 

s 



CD 
H 
fa 



H 

■a 



o 
o 

CM O 

o 
< © 



w — u cm 

s 

i 

s 

Si 

s 



H 



1 

h q 

o 
o 

o 
o 

! 

u 













o 




CM 


H 








s 


< O 




— qH CN 




<< 




y 
















r \ 




















rn 








r ] 








r i 




£-* 




o 




u 








£-* 


H 








<-< 


e-. o 




— a cm 




S3 




< 







M 
3 



fa 

H 



H 
fa 



-CJ 



o 



LU 
CM 

d 

LL 



o 25 



g fe s s g £ 5 ^ 5 



D g £ o _ £ £ £S 

S £s S S 8 - g 

S 55= 5 S < o G 



O 7^ OO — O CM 



<_? CU 



o _ z: jx ^ 



^ 2 G o 13 £ ^ 0 rif2 



o 
o 

I— 



*e L_ O 7-; <o <c — O — I CM 

5 S ^ g - G cm g 



H- O 



CO 



CM g 2 S CM CM 

















O 






























H- 






O 






>— 












>— 








CM 




>— 






w 


O 




CM 










<C 










cu 




0 


-0 


< 


0 






CM 


































i 











CM 

ci 



CM 
CM 



— — <a ocurN o 

CO »CJ >O4CQC0O 

TOOL CO UDOCM 

ac <n cx — oo cmcncm 
acz; — -o cc cso* 

o««c 
-< o 
o k» 

OCX. 

*— e 
«c en 



ux: 

o « 
o— 
o«< 

« 
o*c 

o-«c 

OO . 
— . — O • 

co— <j w o««ccNidr>«s3rcM 
l— cc c<-w-^»^w w 
— *-<o CNCN 





<3 




— 








o— 




H-C3 




<J> 








1— CO 




0> 




o w 




U£ 




<-*:>— 
o o 




O 4- 








O UO 


o 




u 


►—»■■■ CM 






CO 




o> 




o u. 




O 49 




<n 




O CO 
























o 




CM 




CM 




oo 

-sr 
cm 

— 0<0 — • 

cj 00 

CO— 0=04* 

— h-oj^t 

<— O^CN 

-C — 

OO 

o c 



O to 

O 



O cn 
O >» 



O cs 

OO 
OfiS 

oo 

O 2 

OO 
O 3 



— o o 

— O L 

cc o w. ~ 
rs— ocucni 
o as 
o w 

oc 

oo 
o cc 



O Ok 
O 

o>> 



q,CO o— oo 
« Q oomro 

aire— *o t— cnoj 
o <u 

oe 



-co 

ot- 

UO.H 

^CUQO 



OO 
O w 



CN 
CM 



— <o OOfO 

— a o o^xo 

»o wcsim 

CD OO. 

— o >* cm 
o— 
oo 

O G 

oo 
oo 



oo 
o c 

<n 

03 
OO 

o— 



oo 



w ooo 

<n Of 



o> 
o 

CM 



oo 

in 

00 
CM 



o 

CM 

d 

IT 




o 



vo 



in 



0) 
U 

0) 



CO 

o 

a 
o 
c 

<D 
O 

Pi 

3d 

to 

-u 
to 

& 



<0 

o 

CO 
CO 

n 



u 
o 



-u ■ 



1 



M 

a) 



S-4 



M 

w 
PS • 




H 

H H 

h 2 e 
a ^ — a m oo 



■I 
I 

a 
a 

i 

I 

3 

U 

i 

o 
o 



CM 



o 
m 



0 

H 
fa 



IT) 
LO 



CM 



o 





8 


H H 




M M M 




Pi nJ m 






O ro O o 




— CJ o ^ ^ 




— O ^ 








a 




ATTC 




ATTA 




Eh 



8 

I 

a 
eh 
a 

&-* 

i: 

a 

i: 

Eh 

Eh 
O 
^< 

a 
a 
C3 



s 

&H 

Eh 

Eh 

I 



a 
a 
o 

O 

< 

a 
a 

Eh 
Eh 

I 

a 

tn 



T-t &H 
tH U 
rH &H 

*— S: 

a 
a 

I 

Eh 
Eh 

a 

o 
a 

a 

£h 

Eh 

e 

Eh 



u 
o 

s 

Eh 

g 

I 
8 

EH 

o 

CO 



IT) 



H fH 

«3 id 
co as 



H 

i< H 

ro H 

id tn 

CO CQ- 



Eh 

8 

Eh 
Eh 
Eh 

8 

E-> 
O 



Eh 
&H 

£h ro m 

U <3\ <Ti 

-i< lo in 

Eh 

8 

O 

o 

CD 
Eh 

u 

& 

P 

o 
u 

§ 

o 

Eh 

u 

Eh ^ ^ 

< VO <JD 

-O in in 

a 

U 

a 
a 

in 
tn 




a 

8 

Eh 
Eh 
Eh 
Eh 

| 

a 

8 

O 

a 
S3 

Eh 






£h 




Eh 




E-* 












a 


H 


Eh 


H 




H 




<P 


a ^ 


05 


U m 


X 


— a 




a 








8 




a 




Eh 






H 






o 


a 


Eh m 


•l-f 


O CM 


X 


— < 




o 




CM 




CM 







0 
H 
Ed 



m 



oo 



u 



Eh 
Eh 

I 



in 













£h 

i 








£h 






M 


s 




u 


C! 


£H M 


M -H 


H o 




— i< r- en 


<d 




Eh 


O 

i: 




8- 








a 




E-» 




M 








VO 



H 3 Eh cn 

a < — a oo r- 



H H 
4J (C 



o 

u 
u 

s 

q 
IS 

t 

U 
Eh 

a 

q 



CO 



H 
M 
0) 



Eh 

^ H H 

U VO VD 

o 
a 
< 





























£h 


M 


a 


H 




H 




<U 


O ao 


«S 






O t> 




o 




q 



H <tj 

MM £h 

3 0) Eh CO CO 

.D (tf O VO VO 

c/} tc — O i> 
O 

a 



Eh 
Eh 

O 

vo 



f: 

I 
I 

E-» 
C3 




H 




M 




M H 








W (0 H 


a 




— a 


U 


O cn 






CQ 






O 




u 
















a 




Eh 



CM CM 

cn cn 



H 




3 




M 


£h cm 


< 


— U cn 
























8 

■s 








TTG 




GTT 




a 












U 












a 



o 



CD 

H 
fa 



CTl 
CM 
CO 



t— ! 
< 

m 

H 3 



o 

00 

m 



< 

i 

O 
E- 

^ UD CO 



M 
< 

cn H 

to cq — g 



H 
3 



a 

O m m 

En H H 
< fO 



3 



2 



H 
(G 

X! 
53- 



CO 
ID 

3 

O 
— E- 

a 

£h 

a 
e 

E-» 
CD 

1 

a 
o 
-a 
a 
o 
u 

a 
o 
eh 
u 



CO 



3 



00 

(d in 

rH <H 

tn 
>i 
u 
u 

Q> 
CO 

a) 

H 
W 

a) 

CO 



£h 

a 

o 

E-* 

u 
u 
a 
u 

< HI 

< u 

U 0) 

. Eh CO 

Eh C» 

O " 
U 

a 
o 
u 
u 

u 
a 



o 



Eh 
Eh 

CJ 
H» 



0> 
C 

u 
a) 

CO 



CN 



00 
CO 



in 



LO 
rH 



IT) 




<T\ 



o 
e« 
o 

% 

I 

o 

Sc 

o 
Eh 

$3 

I 

s 



a 

El 



M 

ffi < 

Q 



o 
a 
< 



j 

m 

cn tn 
ar- 
c/2 t-H 

rH 



cn — o 
Eh 

u 
a 

H 

Eh 

<3 



3 



u 
u 
< 

a o 

O rH 



»4 m 
&^ 

rH 

a. 
w 

3? 

u 



oo 



E^ rH 

eg 3 
o o 



CO C5 

% 

a 
a 



£h 

H < 

a) o 

■ Eh 

Q 



O > CN 



2 



CO 



oo 



CO 



CPH H CJ 
CO — * 



< CM O CM 
O *~f CM i~4 
•O CJ H H 

•O CM 



CD 



3 <J 3t©H 



w CO 

o Po 



O 
H 
fa 



W CM 
M CM 

^ MHO 

C < — O cm tn 

a ^ 

X < CM 



o 

CM 



i 

Q 
U 

O 

-H 

-H 
O 
<l) 

cn 

<l) 

w 

4J 
01 
0) 



s 



< 

*0 




0 
53 ■ 



o 



S 



Si 

S o 

-O V4 

v a 



3, 



o 



E 

x: 

cn ~ 
.5 o 

® s 

w QJ 
£ 0) 
§.£ 

o 

^ 00 

ll 

£ £ 

to 

a> jo 

< < 

z 

u CL 

'B 6 
« -o 

Q O 

q. a 

CD ~ 

° 8 

ceil o 
^_ c 

° E 

a? 

§-« 

CD 

a) £ 
■o £ 



CD 

c 

CD 

c 

CO 
€0 

o 0 
15 H 

CD 
CO 
O 

a. 

!s 

c 

O 

Ql 



O >» <0 
<D -Q 



c 2 « 
©eg 



O .22 



3 

cr 

<D 



in 
m 



rH 



m 

00 



CM 

uo 



rH ro 



Pi 



CO 



&h ,c 




(< rH 








u o 
















< rH 




u < 




o o 




a a 




Eh s 




CD ^ 








Eh 




Eh CO 








a o 








Eh 0) 




M Eh O 




r J 




< F bl 




&h M 




m 0 0) 




a a> 




2 ^ CO CO 




^ CO 




id < >i^> 




Eh 0) 




CO O rH ^ 




Eh rH 




u o 




< M 








CJ 2 




p ^ 




&n 0) 








a j 




<; s 








£h <D 








Eh ^ 








a >i 




a > 




a <h 




O rH 


1*4 






q a 


h < a 








< < rH 








m a a 




< ^ 




3 Eh acM 








crj < win 




< >t 




co — a <c ^ 








a a 
> 




<! a) 

fH rH 


H 






< H 


Elj 


o <u 




o c 


Eh rH 




<J rH 












Eh O 








(J c-l 




O rH 




CJ Oi 




O <J 








<jj rH 




o £Z 




E-* <tf 




< £r* 




O > 




O rH 




< >l 








O rH 




3 O > tn 




q a 




rH Eh rtj m 








< CJ r-i <Tj» 




<q rH 




q < 




o o - 








< a> 




O rH 




«-» 




q o 




< H 




ft to 




E-» 03 




< >i 




a ^ — i 




< j 




a < 








o <3 








CJ rH 








O < 




O Di 


H 






a ^ 


H M 


fP fTi 




<c < 


? 2 


O > cn <r\ 




a a 


p> rH 


E-t aS oo co 






& < 


O rH ^ ^| 
















< u 




< CO 




u a; 




o < 




E-« CO 




Eh rH 




fH <U 





rH 



oo 



vo 



o 
cn 
vo 



00 
00 



CO 




M 
M 

8- 



vo 




o 




MM E-* 

m 3 e 
>h E-* 

* ~% 

M E-» 

«J o 
Qi O 

— a 



CO 

o a\ 
m r- 
o 





a 














E* 




i 






< 








Phr 


M 


a 


O 

^ t— f 


aj 


a 


P4 ^ 


w 


a 


^ O 


oi 


E-* 






a 


u 






0) 












m 



1 
I 

8 

C3 
E^ 

U 

Eh 

1 

Eh 

a 

o 



a 



H 
fa 



00 



3H11 TTTTTTTGAGCAAAGGGCAACTCAGATATCCAAAGATGAATCCAACATATA 51 



3H11 GCTTACAGCTGGGAGAACATTGTCTAACTCTTCTGAAATTTAAATGTTATC 102 



3H11 CAGAATCCTTCATCATAAAATAATATCAAAATGCAAATCTATTTTTTCTAC 153 



3H11 TCTTGTCTAGCTTCAACTTTCTTCTTCTGCTCATCAATTAGCAATTAATCC 204 

TGCTCATCAATTAGCAATTAATCC 



3H11 AAAACCATTATGGCTGCCAAAAATTCAGAGATGAAGTTTGCTATCTTCTTC 255 
2A1 1 AAAACCATTATGGCTGCC AAAAATTCAGAGATGAAGTTTGCTATrTTrTTr 

METAlaAlaLysAsnSerGluMETLysPheAlallePhePhe 



3H11 GTTGTTCTTTTGACGACCACTTTAGTTGATATGTCTGGAATTTCGAAAATG 306 
2A1 1 GTTGTTCTTTTGACGACCACTTTAGTTGATATGTCTGGAATTTCGAAAATG 
ValValLeuLeuThrThrThrLeuValAspMETSerGlylleSerLysMET 



3H11 CAAGTGATGGCTCTTCGAGACATACCCCCACAAGAAACATTGCTGAAAATG 357 
2A1 1 CAAGTGATGGCTCTTCGAGACATACCCCCACAAGAAACATTGCTGAAAATG 
GlnValMETAlaLeuArgAspIleProProGlnGluThrLeuLeuLysMET 



3H11 AAGCTACTTCCCACAAATATTTTGGGACTTTGTAACGAACCTTGCAGCTCA 408 
2A1 1 AAGCTACTTCCCACAAATATTTTGGGACTTTGTAACGAACCTTGCAGCTCA 
LysLeuLeuProThrAsnlleLeuGlyLeuCysAsnGluProCysSerSer 



3H11 AACTCTGATTGCATCGGAATTACCCTTTGCCAATTTTGTAAGGAGAAGACG 459 
2A1 1 AACTCTGATTGCATCGGAATTACCCTTTGCCAATTTTGTAAGGAGAAGACG 
AsnSerAspCysIleGlylleThrLeuCysGlnPheCysLysGluLysThr 



3H11 GACCAGTATGGTTTAACATACCGTACATGCAACCTGTTGCCTTGAACAATA 510 
2A1 1 GACCAGTATGGTTTAACATACCGTACATGCAACCTGTTGCCTTGAACAATA 
AspGlnTyr^lyLeuThrTyrArgThrCysAsnLeuLeuPro . 



FIGURE 5A 



3H11 TCAATGATCTATCGATCGATCTATCTATCTATTTATCTGTCTCTGCGCGTA 561 
2A1 1 TCAATGATCTATCGATCGATCTATCTATCTATTTATCTGTCTCTGCGCGTA 



3H11 TAGTGTTGTCTGTACCTTTGGTGTGAAGAATATGAATAAAGGGATACATAT 612 
2A1 1 TAGTGTTGTCTGTACCTTTGGTGTGAAGAATATGAATAAAGGGATACATAT 



3H11 ATCTAGATATATTCTAGGTAATGTCCTATTGTATTTAAAATTTGTAGCAAT 663 
2A1 1 AIC1AGATATATTCTAGGTAATGTCCTATTGTATTTAAAATTTGTAGCAAT 



3 H 1 1 GATTGTTTGAATAAAAACATACC ATGAGTGAAATAATTATTCCAC ATTAAT 714 
2A1 1 GATTGTTTGAATAAAAACATACCATGAGTGAAATAATTATTCC 



3H11 TCACGTATTTATTTCACTTATGATACGTATTTTTGTTCCTTTCGCGTAAAA 765 
3H11 AAAAAAAAA 774 



FIGURE 5B 



2A11 

PAIb 

Chicic pea 
la hi bi cor 

Lima bean 
inhibitor 

c<j -antitrypsin 



® M A L R 
®C SPF 
©C T-K 



L C T-KS 



LG AIPMS 



IPP 



IP PI 



IPPQETLL 
DftPPpGSPLCRCI 
S|IPP QCRCN 

QCRCT 

E V 



GSFL CB 



2A1I 
PA lb 

Barley chloroform/ 
methanal-solabie * TNLjL 
protein d 



tniE]gl£*epcssksd 



GN 



/ 



Wheaic^-araylasc V S A 
iniiibicor GJZS 



Wheat albumin 



CIPAGL YIGNjctR 



LTG 



Millet bi-^actiaaai KNPpL[D Si 
inhibitor 



Castor bcaa 2S 
small iubunic 



CR-FYL VQQxjc; 

v. 



QQNLRQC 



Napia small jubunic A Q NjLjRA C^QQWLNKQAMQS 



• AM V£LQ 



VP A IJP ACRPL-LRLQ- 



IE 



K 



CO 

QEYISQQ VSGQ 



FIGURE 6 



2 All GENOMIC 








1 A 

10 


o a 


30 


40 


50 


CTCGAGCCCT 


TTAAAAAGTA 


TAGTCAATAT 


TTACGGTGAC 


CGTGAATTTC 


60 


70 


80 


90 


100 


TTAATTATGA 


TATATAATTT 


AAAAGAAATC 


ATGATCACAT 


TCTACTGATG 


110 


120 


130 


140 


150 


AGAACATGTG 


CTAATCAAGG 


GAAAACATGG 


ATGTGAAAAA 


TACTTTTTGT 


T CA 


1 7A 

X f v 


180 


190 


200 




A A a a A ATfSTf"^ 


AAA 1 XX i I 1 


AOT T ATTTAC 


TACCTATACA 


JL X V 


9 7 n 






250 






I JL 1ALAAA1A 


IL i AA I AbAA 


GATTTTCACC 




270 


Z <3 U 


^ yo 


300 




TATGTAAATT 
x t\ x vj x rvrui x x 


AA I 1 A x AA X \3 


A AT arTPTP A 
AALAL X Vw x L. A 


CATAAAATAA 


310 

J X \J 


320 
^ ^ 




u n 

J *t u 


*5 C A 

350 


TTATCAGTAT 


ATACATTAAT 




TAf AATP.AAT 


1 AAATAAAAT 


360 
—j v w 


370 


^ fin 

J O w 




a n a 
400 


GTAftAACATG 
x AUAn^ r\ x w 


ATPTArAPTT 


r 1 A AT A A A ATT 


A ACAPPATA A 


AbAATAATTT 


din 

*» X u 


*i * V 


d in 

4 J u 


4 4 n 
^ ^ u 


450 


a a a i^iTsr 
LMAft 1 A 1 AU 


ALA ltjl LAAt 


HATTV A ATTB.T 

AA1 AAA I X Al 


X ivjCAl Al IA 


TATTAACTTA 


j< c a 

4 OU 




4 80 


4 90 


500 


CTAAACAATC 


TTTACTTTTG 


AAATATAAAA 


ATAATCAAGT 


TATAAGTCTG 


510 


520 


530 


540 


550 


CTCAAAGTAA 


AGCACTTGTT 


AGACTCATCT 


GATTTTGAGA 


AGGTAAGCAA 


560 


570 


580 


590 


600 


ATTGATGGTG 


CATAATAGXC 


ACAAGTAAAA 


TATAAAATAG 


ATTTCATTAG 


£ i n 




630 


640 


650 


X iT-TLrt-fti A O X 1 


Ixx x AL 1 1 1L 


TTTATATATA 


ATTATCAATA 


TCCTTCAATG 




67fl 
D / U 


C Q A 


^ (i a 
6 90 


700 


vj X i\\3\3 X x nni 


TATATTCTTA 


AC liL-llljil 


bAA i I AAAGG 


AATAAGACAA 


71 n 


7?n 


/ JU 


*7 A A 


750 


r*S A AT ATTA A 21 
yjr*U\ i nil AAA 


fiATAAAA^AA 
^ A X nnnnu AA 


LAA I AAAAA i 


A f 1 A Sin /-*m * 

AvjAAAtjACTA 


AGAGATAAGA 


7 fin 


77fi 


i q n 


*? rt rt 

790 


800 


i 1 iLl i AA 




AXAACj iaila 


ff * /~»m /*»m m 

T C AAGT GTAT 


ACAATATAAA 






8 30 


840 


850 




mmm /-» ■» m^m m m 

TTTGATCTAT 


CTATTTATAA 


TGTTATATAT 


AAGCATACAA 


a c A 


o / U 


a o rt 

880 


890 


900 


AAGATCAGTC 


ATAAATATGA 


CTTTAlATCAT 


GAAAATAATG 


AAAGAGATTA 


910 


O 1 A 

920 


930 


940 


950 


TGAAGGCGTA 


AGGTTACTAG 


AATAATAGTC 


ATTAAAAAAA 


GGGGTTATCT 


960 


970 


980 


990 


1000 


TTATAATTGA 


ATAATTGATG 


AAGTAATGGA 


GATAATTAGT < 


GAGCATAAAT 


1010 


1020 


1030 


1040 


1050 


TTTTTTAAAA 


AAATGGACAT 


TTACACTATA 


ATATTTTATA . 


ACACTTTCCC 


1060 


1070 


1080 


1090 


1100 


TTAAACATCT 


AGGTATAAAT 


AATGAGTCTT 


GTCAAAATCT TAGTAGGAAA 



FIGURE 7A 



1110 1120 1130 1140 1150 

AATTCTGTGA AATTTTTTTA GTGAAAACAA ATGATATAAA TATCTTGAAT 
1160 1170 1180 1190 1200 

ACTCATTATT TGTTGTCTCA TTAAAAATCT TATCTGACCT ATAAAATAAA 
1210 1220 1230 1240 1250 

TTATTTGCTC AACTCAAAAT AGTTTTTCAT TCTAAAATTA GTATAATTAT 
1260 1270 1280 1290- 1300 

TAGTGAATAT TTAATTAACA TAATTGTATA CTAAGGGGCC TATAAATTGG 
1310 1320 1330 1340 1350 

ATTCTTCTCA AAGAAAAATA AAATCACCAC ACAACTTTCT TCTTCTGCTC 

1360 1370 1381 -1390 

ATCAATTAGC AATTAATCCA AAACCATT ATG GCT GCC AAA AAT 

MET Ala Ala Lys Asn 
1399 1408 1417 1426 

TCA GAG ATG AAG TTT GCT ATC TTC TTC GTT GTT CTT TTG 
Ser Glu MET Lys Phe Ala He Phe Phe Val Val Leu Leu 
1435 1444 1454 1464 1474 

ACG ACC ACT TTA GGTTCACAAC ACTTCTCCCT TATTTTGTTT 
Thr Thr Thr Leu 



1484 

J» T W T 


1494 


1504 


1514 


1 S?4 


Tf TT A ATTTf"' 
i ^— x xruii x x ^ 




TATGCATGTG 


TTTHGTATTA 










J. JoH 




A iAAAWj AAA 


AiAl ill 1 


lAAi iAUlvjia 


TTTTPT 8. 1 TV* 


iTTGGTAGGT 


1584 


1594 


1604 


1614 


1624 


AATCGGAAAT 


TATTATGAGA 


TAATGAACTT 


GCAAAGTCAT 


TATTATATAA 


1634 


1644 


1654 


1664 


1674 


CTTTTTTTTT 


ATACTTTGAT 


TTAAGAATTC 


ATTTTTCTCA 


TTTTATATAA 


1684 


1694 


1704 


1714 


1724 


ACTTATTTTT 


CAACAGAAAA 


TATTTTTCGA 


ACTATTCAAA 


CACACCCTAA 


1734 


1744 


1754 


1764 


1774 


GACATTACAT 


ATATATATAT 


ATACACCCTC 


CGTTTTATAT 


TACTTAATGC 


1784 


1794 


1804 


1814 


1824 


CTATTGAGTT 


GGCCCACCCT 


TTAAGAATGA 


TTCAATTAGA 


GATATGTTTT 


1834 


1844 


1854 


1864 


1874 


ACTAAATTAA 


CCTATGCTTT 


AAGACTCTAA 


ATTTGGCTAT 


TACTATTTTA 


1884 


1894 


1904 


1914 


1924 


CGTTGTAATT 


TAATGACAAA 


CATTTCATAA 


TGACTATAGT 


CTGAACTTAA 


1934 


1944 


1954 


1964 


1974 


TTAGACAGAC 


GTATCTATAG 


TTTGCTTACT 


AATGATTCAT 


AGCTATATAT 


1984 


1994 


2004 


2014 


2024 


TTGGAGAGGA 


GAGAGACAAA 


CGATATTAAG 


AAAGGGAGGA 


GAGAGGCGAG 


2034 


2044 


2054 


2064 


2074 


GTAAATCTGA 


AATAGAGAAG 


AGAAAGGCAA 


CCAATTTTGA 


TCATCTATCA 


2084 


2094 


2104 


2114 


2124 


TACTTTTGAT 


TATTATTTTT 


ATTATATGTA 


CGTTTACATT 


ACAGTTTTCG 



FIGURE 7B 



2134 2144 2154 2164 

AATTCTTACA TTAATCTTAA TCATAATATA TACA GTT GAT ATG 

Val Asp MET 
2173 2182 2191 2200 

TCT GGA ATT TCG AAA ATG CAA GTG ATG GCT CTT CGA GAC 
Ser Gly lie Ser Lys MET Gin Val MET Ala Leu Arg Asp 
2209 2218 2227 2236 2245 

ATA CCC CCA CAA GAA ACA TTG CTG AAA ATG AAG CTA CTT 
lie Pro Pro Gin Glu Thr Leu Leu Lys MET Lys Leu Leu 

2254 2263 2272 2281 

CCC ACA AAT ATT TTG GGA CTT TGT AAC GAA CCT TGC AGC 
Pro Thr Asn He Leu Gly Leu Cys Asn Glu Pro Cys Ser 

2290 2299 2308 2317 

TCA AAC TCT GAT TGC ATC GGA ATT ACC CTT TGC CAA TTT 
Ser Asn Ser Asp Cys lie Gly He Thr Leu Cys Gin Phe 
2326 2335 2344 2353 2362 

TGT AAG GAG AAG ACG GAC GAG TAT GGT TTA ACA TAC CGT 
Cys Lys Glu Lys Thr Asp Gin Tyr Gly Leu Thr Tyr Arg 

2371 2380 2393 2403 

ACA TGC AAC CTG TTG CCT TGA ACAATATCAA TGATCTATCG 



Thr Cys Asn Leu Leu Pro 



2413 


2423 


2433 


2443 


2453 


ATCGATCTAT 


CTATCTATTT 


ATCTGTCTCT 


GCGCGTATAG 


TGTTGTCTGT 


2463 


2473 


2483 


2493 


2503 


ACCTTTGGTG 


TGAAGAATAT 


GAATAAAGGG 


ATACATATAT 


CTAGATATAT 


2513 


2523 


2533 


2543 


2553 


TCTAGGTAAT 


GTCCTATTGT 


ATTTAAAATT 


TGTAGCAATG 


ATTGTTTGAA 


2563 


2573 


2583 


2593 


2603 


TAAAAACATA 


CCATGAGTGA 


AATAATTATT 


CCACATTAAT 


TCACGTATTT 


2613 


2623 


2633 


2643 


2653 


ATTTCACTTA 


TGATACGTAT 


TTTTGTTCCT TTCGCGTAGA TTTTTGATCC 


2663 


2673 


2683 


2693 


2703 


TTTTCCCTTT 


TGAATATTAA 


ACATTAAACA 


CAAATAATGT 


TTATTAAATT 


2713 


2723 


2733 


2743 


2753 


AAGTTAATAT 


TTTTATTTAG 


CTATTTATAT 


TTTTATTTGA AATCAAACTT 


2763 


2773 


2783 


2793 


2803 


GATAAATATT 


TATAAAGATA 


ATTAACAAGT 


AATGTGACAC 


TAACACCATG 


2813 


2823 


2833 


2843 


2853 


TAATATTATC 


TTGTCGTTAT 


TTATGATAAT 


ATTTTAAAAT 


TATAATTTCA 


2863 


2873 


2883 


2893 


2903 


GTTAAAAAAT 


TATTAAAAAA 


ACATACTTTT 


AAAAAGTGAG 


TTAGCCTCCG 


2913 


2923 


2933 


2943 


2953 


CTACCCACAT 


ACTTATGAAT 


TGGACTAGTT 


GTTTTTTGAC 


CCACAAAAAG 


2963 


2973 


2983 


2993 


3003 



AATGGGCTAA TTAAACCTGA CCTATCAAAT TTCAGAATCT GCATAGATTA 



FIGURE 7C 



3013 


3023 


3033 


3043 


3053 


GTCCGAACGA 


AATGAGTCAG 


CCCGTATTGA 


ACAAAATATC 


AACAAGGACG 


3063 


3073 


3083 


3093 


3103 


TTATGTAAAG 


ATGTTTAAGA 


AGGAAAAAAG 


ATTTCTAATA 


CATATGGACT 


3113 


3123 


3133 


3143 


3153 


TTCAATATCC 


CAACTTTGTC 


TGGCGATCTG 


AACCCTGCTT 


AGTTTGTTGA 


3163 


3173 


3183 


3193 


3203 


TCATTAACTT 


GTCTTGCTAT 


GTATTTAAGA 


TTTAAACTTT 


ATATGTTTAA 


3213 


3223 


3233 


3243 


3253 


ACTTACAGAA 


AATACATATA 


AATCTCTCAA 


GACTTGGCAA 


CATAATTTAC 


3263 


3273 


3283 


3293 


3303 


TTTAGTACTT 


AAACTACATG 


AAAATTTAAA 


TATCCTTTTA 


ACATCTTTGA 


3313 


3323 


3333 


3343 


3353 


AGTGAATTAA 


ATTATCACAA 


TCCGAGCCTA 


CACCTTGGAC 


GTGGCCGGCA 


3363 


3373 


3383 


3393 


3403 


CTCAAGAACC 


AGTGCTGGTC 


CCCAAGCTAA 


CCCTCATCCT 


GACTGACTAC 


3413 


3423 


3433 


3443 


3453 


AAGCGGAAGG 


CTAACTTAAG 


TATACAAAAG 


CTTAAAACTG 


AATAAAATAA 


3463 


3473 


3483 


3493 


3503 


ACTTTACAAG 


GTTTTAACAC 


AAATGAACAA 


CTTTGAAGAA 


AATAATATAT 


3513 


3523 


3533 


3543 


3553 


TCAACTAGCC 


ataaaatXga 


CAACTTTAGT 


CTTTAAAACA 


TTTAATAAAA 


3563 


3573 


3583 


3593 


3603 


TAAATGCAAA 


ATATAGACTC 


CTTAACTAAA 


CTGACTATCT 


ATGGAGCCTC 


3613 


3623 


3633 


3643 


3653 


TAATTGATAA 


AGATGGAAGT 


CGGGACAAGA 


CCACGACATC 


CTGACTAAAC 


3663 


3673 


3683 


3693 


3703 


TGAGAAGTAA 


ATAAAATCCC 


CCGGAAAAAA 


AGGAGCCTCA 


CCATGGCTAA 


3713 


3723 


3733 


3743 


3753 


CTCGAACTCG 


GGGATATATC 


AATGAAGCTC 


CTGTTGATGA 


TCTTGAAGAC 


3763 


3773 


3783 


3793 


3803 


ATGTCTCTGC 


ATCATCAAAA 


AGATGCAGGC 


CAAATGGCTC 


AGTACGTAAA 


3813 


3823 


3833 


3843 


3853 


ATGTACGAGT 


ATGTAAGGGA 


AATTCTAAAG 


TATAACATAA 


GCTTGATACT 


3863 


3873 


3883 


3893 


3903 


TGAATAAAAG 


GAAACATACT 


TACCTCTTTT 


CAACTCAACT 


CAAATTAAGA 


3913 


3923 


3933 


3943 


3953 


ATAAGATACT 


CAACTCAAAG 


ATTAGGTATT 


CAACGCAAAT 


ATGGCACTCT 


3963 


3973 


3983 


3993 


4003 


ACTCAATGAA 


GTACAAATTA 


ACTCAGGATA 


CTCGACTTAA 


GATACTCAAC 


4013 


4023 


4033 


4043 


4053 


TCCCGACACT 


CAACTGAACT 


CATTTCAATA 


TAAAGCAGCT 


TAAAACAAGT 


4063 


4073 


4083 


4093 


4103 


TCAGTATAAA 


GTAAAGTTGT 


TTAAAAACAT 


GATGTCAACT 


CTGTGTGTAT 


4113 


4123 


4133 


4143 


4153 


AATAAGGGAT 


ACAACATAAC 


TTTGAAATGT- ATATAAAAAT 


ACAATTAACT 



FIGURE 7D 



4163 


4173 


4183 


4193 


4203 


GATGTATATA 


AAAATACATT 


AATCTATGGG 


AGATTCTCTA 


ACCGACAACC 


4213 


4223 


4233 


4243 


4253 


ATCACTTAAG 


GGCTAAGATG 


ATGATATAGC 


GATCTACCGC 


ACGCTGCCAT 


4263 


4273 


4283 


4293 


4303 


CGCATCTTAT 


ACCCGGCCAA 


AGGTATAAGA 


CCTGAACTGC 


CTAATGAATC 


4313 


4323 


4333 


4343 


4353 


CACTAATAAA 


CTGTTAAAAG 


GAATCATCTA 


AAAAGTATGA 


CCCTTTTCTA 


4363 


4373 


4383 


4393 


4403 


CCCATAGTGG 


CTAACATGGT 


TTATGGGGGC 


TGTGAGTTAT 


CTGAACTCTC 


4413 


4423 


4433 


4443 


4453 


CCCCATATCG 


GTGCTCAATA 


CTACTCCAAA 


AAATATACTG 


CTCTTATGTT 


4463 


4473 


4483 


4493 


4503 


TAAAAACATA 


CTGATTCTGT 


GGTTTGAAAT 


TATTGCTTAA 


AGCTTAGATT 


4513 


4523 


4533 


4543 


4553 


TTTGAAAAGC 


TCTCTTTTGA 


AAATCGTAGT 


TTCCTTTTTC 


TTCTATTAAA 


4563 


4573 


4583 


4593 


4603 


GCTAGACATA 


GGCTATGTAG 


AACTCTAGCT 


TACCTTCCTT 


CTCAAAAGTT 


4613 


4623 


4633 


4643 


4653 


TGAAAACATT 


TGCTTAGATT 


CTTAGGGACT 


ACTTAGTTCC 


CTTGTTGGAA 



TTC 



FIGURE7E 



PG GENOMIC 



10 20 30 40 50 

AAGCTTCTTA AAAAGGCAAA. TTGATTAATT TGAAGTCAAA ATAATTAATT 

60 70 80 90 100 

ATAACAGTGG TAAAGCACCT TAAGAAACCA TAGTTTGAAA GGTTACCAAT 

110 120 130 140 150 

GCGCTATATA TTAATCAACT TGATAATATA AAAAAAATTT CAATTCGAAA 

160 170 180 190 200 

AGGGCCTAAA ATATTCTCAA AGTATTCGAA ATGGTACAAA ACTACCATCC 

210 220 230 240 250 

GTCCACCTAT TGACTCCAAA ATAAAATTAT TATCCACCTT TGAGTTTAAA 

260 270 280 290 300 

ATTGACTACT TATATAACAA TTCTAAATTT AAACTATTTT AATACTTTTA 

310 320 330 340 350 

AAAATACATG GCGTTCAAAT ATTTAATATA ATTTAATTTA TGAATATCAT 

360 370 380 390 400 

TTATAAACCA ACCAACTACC AACTCATTAA TCATTAAATC CCACCCAAAT 

410 420 430 440 450 

TCTACTATCA AAATTGTCCT AAACACTACT AAAACAAGAC GAAATTGTTC 

460 470 480 490 500 

GAGTCCGAAT CGAAGCACCA ATCTAATTTA GGTTGAGCCG CATATTTAGG 

510 520 530 540 550 

AGGACACTTT CAATAGTATT TTTTTCAAGC ATGAATTTGA AATTTAAGAT 

560 570 580 590 600 

TAATGGTAAA GAAGTAGTAC ATCCCGAATT AATTCATGCC TTTTTTAAAT 

610 620 630 640 650 

ATAATTATAT AAATATTTAT GATTTGTTTT AAATATTAAA ACTTGAATAT 

660 670 680 690 700 

ATTATTTTTT TAAAAATTAT CTATTAAGTA CCATCACATA ATTGAGACGA 

710 720 730 740 750 

AGGAATAATT AAGATGAACA TAGTGTTTAA TTAGTAATGG ATGGGTAGTA 



FIGURE 8A 



760 770 780 790 800 

AATTTATTTA TAAATTATAT CAATAAGTTA AATTATAACA AATATTTGAG 

810 820 830 840 850 

CGCCATGTAT TTTAAAAAAT ATTAAATAGT TTGAATTTAA AACCGTTAGA 

860 870 880 890 900 

TAAATGGTCA ATTTTGAACC CAAAAGTGGA TGAGAAGGGT ATTTTAGAGC 

910 920 930 940 950 

CAATAGGRGG ATGAGAAGGA TATTTTGAAG CCAATATGTG ATGGATGAAG 

960 970 980 990 1000 

GATAATTTTG TATCATTTCT AATACTTTAA AGATATTTTA GGTCATTTTC 

1010 1020 1030 1040 1050 

CCTTCTTTAG TTTATAGACT ATAGTGTTAG TTCATCGAAT ATCATCTATT 

1060 1070 1080 1090 1100 

ATTTCCGTCT TAAATTATTT TTTATTTTAT AAATTTTTTA AAAATAAATT 

1110 1120 1130 1140 1150 

ATTTTTTCCA TTTAACTTTG ATTGTAATTA ATTTTTAAAA ATTACCAACA 

1160 1170*" / 1180 1190 1200 

TATAAATAAA ATTAATATTT AACAAAGAAT TGTAACATAA TATTTTTTTA 

1210 1220 1230 1240 1250 

ATTATTCAAA ATAAATATTT TTAAACATCA TATAAAAGAA ATACGACAAA 

1260 1270 1280 1290 1300 

AAAATTGAGA CGGGAGAAGA CAAGCCAGAC AAAAATGTCC AAGAAACTCT 

1310 1320 1330 1340 1350 

TTCGTCTAAA TATCTCTCAT CCAAACTAAT ATAATACCCA TTATAATTAA 

1360 1370 1380 1390 1400 

CCATATTGAC CAACTCAAAC CCCTTAAAAT CTATAAATAG ACAAACCCTT 

1410 1420 1430 1440 1450 

CCCATACCTC TTATCATAAA AAAAATAATA ATCTTTTTCA ATAGACAAGT 

1460 1470 1480 1490 1500 

TTAAAAACCA TACCATATAA CAATATATCA TGGTTATCCA AAGGAATAGT 



FIGURE 8B 



1510 


1520 


1530 


1540 


1550 


ATTCTCCTTC 


TCATTATTAT 


TTTTGCTTCA 


TCAATTTCAA 


CTTGTAGAAG 


1560 


1570 


1580 


1590 


1600 


CAATGTTATT 


GATGACAATT 


TATTCAAACA 


AGTTTATGAT 


AATATTCTTG 


1510 


1620 


1530 


1640 


1650 


AACAAGAATT 


TGCTCATGAT 


TTTCAAGCTT 


ATCTTTCTTA 


TTTGAGCAAA 


1660 


1670 


1680 


1690 


1700 


AATATTGAAA 


GCAACAATAA 


TATTGACAAG 


GTTGATAAAA 


ATGGGATTAA 


1710 


1720 


1730 


1740 


1750 


AGTGATTAAT 


GTACTTAGCT 


TTGGAGCTAA 


GGGTGATGGA 


AAAACATATG 


1760 


1770 


1780 


1790 


1800 


ATAATATTGT 


AAGTATTTAA 


ATATTGGAAT 


ATATTTGTGG 


GGATGAAAAT 


1810 


1820 


1830 


1840 


1850 


GATAGAGAAT 


ATAAGAATTA 


TTTGGAAGGA 


TGAAAAGTTA 


TATTTTATAA 


1860 


1870 


1880 


1890 


1900 


AGTAGAAAAT 


TATTTTCTCG 


TTTTTAGTAA 


TTAAAGGTGA 


AAAATGAGTT 


1910 


1920 


1930 


1940 


1950 


TTCTCGTAAG 


CGAGGAAAGT 


CATTTTCCAT 


GGAACTGTAT 


TTTTTTTTTA 


1960 


1970 


1980 


1990 


2000 


CTTTTAATAA 


CGTCATAGTA 


TTTGCTATAC 


TCAAGAATAA 


GACACTATTA 


2010 


2020 


2030 


2040 


2050 


TTGATGTTTA 


GTGCTCGAAA 


AGAAATTGAT 


AGTAATTTTG 


CTAATATAAC 


2060 


2070 


2080 


2090 


2100 


TATCAATTTC 


TTATATGTAT 


ATTTTTCAAC 


CAAAATAACA 


AAGCGTAATC 


211U 






0 1 A(\ 
C 14U 


/clou 


CAATAAGTGG 


GCCTCTAGAA 


TAAAGAGTAA 


GTTCTATTAA 


TTCTTAACCT 


2160 


2170 


2180 


2190 


2200 


TATTTAATTT 


TATGGAAACC 


TCGACAAAAC 


GACAATGCTC 


AACTTATATT 



CGAATTC 



FIGURE 8C 



ScoRI 
Hindlll 



I 



1 

J 



Hindlll 
Hindlll 



potyA 

XGA 
EcoRI 



Hindlll 



EcoRI 



FIGURE 9 




FIGURE 10 



r- 




©FY 



CGNE-99-2 



APPLICATION FOR UNITED STATES LETTERS PATENT 
DECLARATION AND POWER OF ATTORNEY 



As a below named inventor, I hereby declare that my residence, post office address and 
citizenship are as stated next to my name. 

I believe I am the original, first and sole inventor (if only one name is listed below) or an 
original, first and joint inventor (if plural names are listed below) of the subject matter which is 
claimed and for which a patent is sought on the invention entitled: 

METHODS AND COMPOSITIONS FOR REGULATED TRANSCRIPTION 
AND EXPRESSION OF HETEROLOGOUS GENES 

and which application was filed in the United States Patent and Trademark Office on March 7, 1997, 
having Attorney Docket No. CGNE 99-2, and the Serial Number designation 08/812,665, which 
application is a continuation of U.S.S.N. 08/484,941, filed June 7, 1995, which is a continuation of 
U.S.S.R 08/105,852, filed 8/10/93, pending; U.S.S.N. 08/105,852 is a continuation in part of 
07/526,123, filed 5/21/90, pending, which is a continuation of 07/267,865, filed 11/2/88, 
abandoned, which is a continuation of 06/692,605, filed 1/17/85, abandoned; U.S.S.N. 08/105,852, 
is also a continuation in part of 07/582,241 , filed 9/14/90, abandoned, which is a continuation of 
07/188,361, filed 4/29/88, abandoned, which is a continuation in part of 07/168,190, filed 3/15/88, 
abandoned, which is a continuation in part of 07/054,369, filed 5/26/87, which issued on 7/24/90 as 
patent number 4,943,674; U.S.S.N. 08/105,852 is also a continuation in part of U.S.S.N. 
07/742,834, August 8, 1991, which issued as U.S. Patent No. 5,420,034 issued on 5/30/95, which 
is a continuation in part of 07/550,804, filed 7/9/90, abandoned, which is a continuation in part of 
07/147,78 1 , filed 1/25/88, abandoned, which is a continuation in part of 07/078,538, filed 7/28/87, 
abandoned, which is a continuation in part of 06/891,529, filed 7/31/86, which is abandoned., 

I hereby state that I have reviewed and understand the contents of the above-identified 
application, including the claims, and including any amendments filed concurrently with the 
application papers. 

I acknowledge the duty to disclose information which is material to the examination of this 
application in accordance with Title 37, Code of Federal Regulations, Section 1.56. 

I hereby claim benefit under the Paris Convention and 35 USC 1 19 of the priority of the 
following previously filed application(s): 

Country Serial Number Filing Date 

No application to the invention of the present application was filed in any foreign country prior to the 
above application(s). 

I hereby claim the benefit under Title 35, United States Code, 120 of each United States 
application listed below and, insofar as the subject matter of each of the claims of this application is 
not disclosed in the prior United States application in the manner provided by the first paragraph of 
Title 35, United States Code, 112, 1 acknowledge the duty to disclose material information as defined 
in the Title 37, Code of Federal Regulations, 1.56 which occurred between the filing date of the prior 
application and the national or PCT international filing date of this application. 



1 



Prior U.S. Application(s) 



Serial No 



Filing Date 



08/484,941 
08/105,852 
07/526,123 
07/267,865 
06/692,605 
07/582,241 
07/188,361 
07/168,190 
07/054,369 
07/742,834 
07/550,804 
07/147,781 
07/078,538 
06/891,529 



6/7/95 
8/10/93 
5/21/90 
11/2/88 
1/17/85 
9/14/90 
4/29/88 
3/15/88 
5/26/87 
8/8/91 
7/9/90 
1/25/88 
7/28/87 
7/31/86 



I hereby appoint 

Donna E. Scherer, Reg. No. 34,719 
Carl J. Schwedler, Reg. No. 36,924 

my attorney of record/agent with full power of substitution and recovation to prosecute this 
application and to transact all business in the Patent Office. 

All further correspondence should be addressed to: 



I hereby declare that all statements made herein of my own knowledge are true and that all 
statements made on information and belief are believed to be true; and further that these statements 
were made with the knowledge that willful false statements and the like so made are punishable by 
fine or imprisonment, or both, under Section 1001 of Title 18 of the United States Code and that such 
willful false statements may jeopardize the validity of the application or any patent issued thereon. 

Full Name of Inventor: Vic C. Knauf 



Calgene, Inc. 
1920 Fifth Street 
Davis, CA 95616 



Telephone: (916)753-6313 



Inventor's Signature: 
Residence and Post: 
Office Address 
Citizenship: 




Winters, CA 95694 
USA 



Full Name of Inventor: 




Jean C. Kridl 



Inventor's Signature: 
Residence and Post: 
Office Address 
Citizenship: 



Davis, C A 956 16 
USA 



